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INTRODUCTION 


A large number of ratios showing the proportions of crossing over that 
occur between different pairs of genes have been determined experimen- 
tally for a particular organism, Drosophila melanogaster, and published 
by Morean and his co-workers. These ratios show among themselves 
a number of extraordinary relations, which demand explanation from some 
unified point of view. The explanation proposed by the workers on 
Drosophila,—the so-called “Linear Theory,”’—is that the genes are 
arranged in the chromosomes in serial order, and that crossing over occurs 
by the breaking of these linear series, and exchange of parts by the two 
chromosomes at the time they pair. 
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The present author (JENNINGS 1918) examined the numerical relations 
involved in a theory of crossing over-_proposed by GOLDSCHMIDT (1917) and 
showed that these are inconsistent with the observed relations. I then 
undertook a similar study of the numerical relations necessarily implied 
by the “linear” theory. At the same time a similar study was carried out 
by HALDANE, who published in 1919 a paper in which some of the mathe- 
matical relations required by the linear theory were set forth. I therefore 
did not at that time complete nor publish my own study. HALDANE’s 
paper is, however, incomplete; it does not bring out many of the most 
important relations demanded by the linear theory, nor does it portray 
the complex system of relations presented by the observed ratios; so 
that its author did not enter upon a full comparison of the required and 
the actual conditions. 

The present paper therefore attempts to analyze with some thorough- 
ness the conditions required by the linear theory, in order to see how far 
the ratios and relations found in nature are consistent with and demanded 
by them. If the linear theory is to be looked upon as a correct account 
of the conditions in nature, certainly a full exposition of the mathematical 
relations which it involves is needed. And such an exposition and com- 
parison will be found to furnish an illuminating, perhaps crucial, test of 
the theory. As will appear, there is a complex system of relations of a 
really astonishing character shown by the known ratios, and any general 
theory is almost bound to betray its truth or falsity when confronted 
with this system. Our analysis will lead us to deal incidentally also with 
the other theories of crossing over. 

The time appears ripe for such an examination. The recent extensive 
study of the characters connected with the second chromosome of Dro- 
sophila, by BripGes and MorGAN (1919), and the still more recent sum- 
mary for the three large linkage groups of Drosophila by BripcEs (1921), 
have, with the earlier paper on the first-chromosome group by MorGAN 
and BripcGEs (1916), and the many papers of more limited scope, by 
MorcGan and his co-workers, made available a mass of most valuable data 
for such an examination. 


THE SYSTEM OF RELATIONS FOR WHICH A THEORY IS REQUIRED 
TO ACCOUNT 


Crossing over is of course a phenomenon of experimental breeding; 
it may be dealt with quite independently of any theory of the arrangement 
of genes or of their relations to the chromosome. So considered, it consists 
in the following: When to form an individual there unite two gametes, 
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one carrying two genes AB that follow the same chromosome, the other 
gamete carrying two alternative genes ab, we find that the gametes later 
formed by this new individual have mainly these same combinations, 
AB or ab. But in a certain proportion of the gametes we find the new 
combinations Ab and aB,—so that A has “crossed over” to join with }, 
and a to join with B. The proportion of all cases in which these new 
combinations are found is called the crossover ratio,! and it varies for 
different pairs of genes from less than one-half of one percent to about 
fifty percent. 

It is the fact that the ratios for different pairs of genes differ, although 
there is a relative constancy in the ratio for any given pair, and the fact 
that these ratios when compared show a number of striking and peculiar 
interrelations, that yield the basis for theories of the matter. These 
interrelations may be illustrated as follows: 

Among the genes connected with the first or X chromosome of Drosoph- 
ila, that have been most fully studied, are the nine given in the following 
table. If we examine the crossover ratios of each of these with all the 
others (as given by Morcan and BripcEs 1916), we find that they yield 
the following table, in which the order is simply alphabetical. (The 
ratios are given as percentages.) 


TABLE 1 


Table of best-known crossover ratios in chromosome I of Drosophila. Genes arranged alphabetically. 
(From MorGan and BrincEs 1916). 
































Bar | Bifid | Club | Minia-| Rudimen- | <4), | Vermil- | white |yellow 
ture tary ion 

RR ee nite Oe . ? 20.5 2.3 13.8 | 23.9 | 43.6 | 47.9 
i A. a) rh ae 30.6 42.7 ? 31.1 5.3 | 5.5 
re ? a ete ? ? ? 18.8 | 14.3 | 17.7 
Miniature........ 20.5 | 30.6| ? 17.9 6.7 3.1 | 33.2 | 34.3 
Rudimentary..... 2.31 42.7 ? 17. 9 eat 14.3 | 24.1 42.4 | 42.9 
RG os 13.8 | ? ? 6.7 14.3 0. | 0.0 142 eS 
Vermilion........ 23.9 | 31.1 | 18.8 3.1 24.1 10.1 “oes 30.5 | 34.5 
| IETS 43.6] 5.3] 14.3] 33.2 wh 1400-31 208 |) ...2 F A 
ee 47.9} 5.5 | 17.7 | 34.3 42.9 42.9 | 34.5 1.1 








In this merely alphabetical table, one cannot fail to notice certain 
peculiar features. The ratios of white and of yellow with the other genes, 


1 The above is the sense in which the terms crossover and crossover ratio have hitherto been 
used in the literature. I understand that the workers on Drosophila are considering the substitu- 
tion of “recombination” and “recombination ratio” for these terms, employing the term “‘cross- 
over” in another sense. 
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for example, are parallel throughout with the values for yellow a little 
above those for white. A similar relation, though less completely carried 
out, is seen on comparing the ratios for miniature and vermilion, or those 
for bar and rudimentary. Further it will be observed that the ratios for 
certain genes with the rest run higher than for certain others. Thus, for 
miniature the highest ratio found is 34.3, and the average is 20.9, while 
for yellow the highest is 47.9 and the average is 28.35. These and other 
relations discoverable by study suggest that there is some system in these 
ratios. It appears worth while to try arranging them in order with 
respect to particular genes, allowing the rest to take the places into which 
this order forces them, in order to see if any system is thus revealed. We 
may select for this purpose the gene having the lowest mean ratios (minia- 
ture) or that showing the highest (yellow); or any other. Various selec- 
tions give results of interest; that obtained by selecting as the basic gene 
the one with the highest mean ratios (yellow) gives however the most 
illuminating results. When we so arrange the other genes as to give the 
ratios with yellow in order from lowest to highest, letting the remaining 
ratios fall where they must, we obtain table 2. 


TABLE 2 


Table of best-known crossover ratios among genes of chromosome I of Drosophila, arranged in series 
with reference to that gene having the highest mean ratios (yellow).* 











Yellow] White | Bifid | Club | VeT™il- | Minia- | 5.4), | Rudimen-| p,, 
ion ture tary 
Velow......... ..| 1.41 | 5.5} 17.7] 34.5 | 34.3 | 42.9] 42.9 | 47.9 
RS 11] .... | 5.3} 14.3] 30.5 | 33.2 | 41.2] 42.4 | 43.6 
Bifid. e681 651....1-% | O41 Ot? 42.7 ? 
ee ont 17.7| 14.3 | 2 |....] 18.8 ? ? ? ? 
Vermilion........| 34.5 | 30.5 | 31.1 | 18.8 3.1 110.1} 24.1 | 23.9 
Miniature.......| 34.3 | 33.2 | 30.6| ? ae} 6.7| 17.9 | 20.5 
Gillie... c...0s 42.9| 41.2 | ? > | 10.1 er }....| 03 |e 
Rudimentary....| 42.9| 42.4 | 42.7| 2 | 24.1 | 17.9 | 14.3] .... 2.3 
Ber.............1 47.0] 48.6] ? > | 23.9 | 20.5 | 13.8 2.3 





























If this table be examined, by following the columns from above down- 
ward, or the rows from left to right, it is at once evident that, with slight 
irregularities, the entire set falls into a system, although we have arranged 


* In the case of miniature (34.3) and vermilion (34.5) the ratios with yellow are so nearly 
identical that the difference (0.2 percent) lies far within the probable error, and there is conse- 
quently no direct clue to the order in which these two genes should be placed. I have employed 
the order which relates them most naturally to the discoverable system of relations. If they be 
interchanged the irregularities of the system will be slightly increased. 
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it in order only with reference to the gene yellow. The ratios with white 
form a series parallel to those with yellow, but each a little lower in value, 
while the ratios with bar form a similar series, but in the reverse. order of 
magnitude. For the other genes the ratios fall into two partial series,— 
one with magnitudes increasing as in that for yellow, the other in the 
reverse order. A similar table will be given if we arrange the ratios 
between the best-known genes of the second chromosome of Drosophila 
in a table in their order of magnitude with relation to the gene having 
the highest mean ratio. According to Bripces and Morcan (1919), 
the eight best-known genes of the second chromosome are black, curved, 
dachs, purple, speck, star, streak and vestigial. If the reader will take 
the ratios given by BripGes and MorGaAN and arrange them in series of 
magnitudes with relation to star or speck, he will obtain a table showing 
the same peculiarities as does table 2. : 

Careful examination of such a table reveals general relations, which 
we will formulate under the following numbered basic propositions: 


Basic propositions 


1. The ratios vary from below one-half percent to about fifty percent. 

(In Drosophila, ratios as low as four-tenths percent are recorded, 
and I am informed by Dr. MorcGan that ratios at least as low as two- 
tenths percent occur.) 

2. If two genes show with one another a low crossover ratio, they 
show nearly the same ratio, low or high, with any other gene. (Thus 
yellow and white respectively show with rudimentary the ratios 42.9 
and 42.4; with club the ratios 17.7 and 14.3, with bifid 5.5 and 5.3, etc.) 

3. If two genes have together a high ratio (as do yellow and bar), 
their ratios with particular other genes are very diverse. 

4. The greater the ratio of two genes together, on the whole the more 
diverse their ratios with a given other gene. 

5. If any two genes, A and B, have -together a certain ratio, and 
B has with a third gene C a certain ratio, then the ratio of the first (A) 
with the third (C) is either a little Jess than the sum of the ratios A-B and 
B-C, or a little more than their difference. 

Thus, the ratio of yellow to club is 17.7; of club to vermilion 18.8. 
Their sum is 36.5, while the ratio of yellow to vermilion is 34.5. The 
ratio yellow-bifid is 5.5; of bifid-miniature 30.6; their sum is 36.1, while 
the ratio of yellow to miniature is 34.3. 

Again, the ratio white-vermilion is 30.5; vermilion-club 18.8; their 
difference is 11.7, while the ratio white-club is 14.3. 
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6. If for the gene having the highest mean ratios, as yellow, we 
arrange in ascending order the series of its ratios with the other genes,— 
allowing the position of all the genes to be thus determined,—then all 
the ratios fall into a systematic table (as table 2), having the following 
characteristics: 

7. Genes having a low ratio with this basic gene (e.g., white) show 
with the others a similar ascending series of ratios, but with each ratio 
slightly less. 

8. Genes having a higher ratio with this basic gene show a similar 
ascending series of ratios with genes having a still higher ratio than its 
own with the basic gene (though with the ratios much decreased in value); 

9. But a reverse series with all the genes having a lower ratio than 
its own with this basic gene. (Cf. the ratios of vermilion with others.) 

10. In a table so arranged, if in any column (or row) we take for a 
gene A its two ratios with B and with C, these lying on the opposite sides 
of the diagonal, then the swm of these two ratios gives approximately, 
but somewhat more than, the ratio between B and C. 

Thus if A is vermilion, B is white and C is miniature, then ratio A-B is 
30.5, ratio A-C is 3.1; their sum is 33.6, which is a little more than ratio 
B-C, which is 33.2. 

The difference between the sum and the ratio B-C becomes greater 
as B and C are farther apart in the table. 

11. But if B and C are so chosen that the ratios A-B and A-C lie on 
the same side of the diagonal, then the difference between A-B and A-C is 
approximately, but a little Jess than, the ratio B-C. 

Thus, if A is vermilion, B is white and C is club, the ratio A-B is 30.5; 
ratio A-C is 18.8; their difference 11.7 is a little less than the ratio B-C, 
which is 14.3. 

To these relations, evident from the table, must be added certain 
others that have been established by experiment: 

12. When there occurs a crossover between any two genes of the 
series (shown in the upper row of table 2), there occurs also as a rule a 
crossover between any gene lying to the right of these two and any other 
lying to the left of these two. 

Thus, if there occurs a crossover between vermilion and sable, as 
a rule there occurs also a crossover between club and sable, club and 
rudimentary, club and bar, white and sable, white and bar, etc., etc. 

13. But in such a case there is as a rule no crossover between two genes 
in the series on the same side of the two genes that show a crossover. 
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Thus, if there is a crossover between vermilion and sable, there is 
as a rule no crossover between any of the genes yellow-white-bifid-club- 
vermilion; these all remain in the same grouping as before. 

14. That is, such series as are represented in the rows or columns of 
table 2 usually in crossing over separate into two groups at a certain 
place in the series, all the members of one group showing a crossover 
with all the members of the other. 

15. But in relatively rare cases, such a series separates at two points 
in such a way that all the genes between the two points show crossovers 
with all the genes outside the two points. Still more rarely, the series 
separates at three points. 

16. In a considerable proportion of the cases (30 to 50 percent), no 
crossing over occurs between any of the genes of the chromosome. (Thus, 
in the X chromosome of Drosophila, MULLER found that there is no 
crossing over between any of the genes from yellow to bar, in 54.4 percent 
of the cases.) 

17. When a crossover occurs between two genes, A and B, having 
together a low ratio, no crossover occurs between one of these genes 
(as A), and any other gene C having a low ratio with A and lying in the 
series on the opposite side of A from B. Or: such a series as that of 
table 2 does not break at two points that are close together (“‘inter- 
ference’’). 

These relations are shown not alone for the 9 genes given in our table. 
If we take the other genes of the X chromosome, they fall, as far as the 
data on them goes, into this same system. Or if we select the most 
thoroughly studied genes of the second chromosome of Drosophila, we 
find that they yield a table similar to table 2, and the propositions set 
forth above apply equally to them. 

It is important to observe that all these relations (table 2, and propo- 
sitions 1 to 17) are purely experimental results of breeding; they do 
not depend in the least upon any theory of what crossing over is or of the 
arrangement of genes. So far as these relations have come to the con- 
sciousness of investigators not engaged in work on Drosophila, it appears 
to be the impression that they are simply postulates of some particular 
theory. On the contrary they are statements of observed fact. 

It is of course true that these relations hold only approximately, so 
far as precise figures are concerned. But when one considers the extremely 
long and complex process of experimentation required for obtaining each 
of the ratios; the great number of extraneous factors coming in to disturb 
the precision of the results, and the number of conditions known to 
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modify the crossover ratios, it must appear marvelous that the system 
reveals itself so unmistakably as it does in tables arranged as in table 2. 
To come out so clearly from beneath such a load of extraneous conditions 
the system must be of some extremely well-defined character. 

It is clear then that there is a system underlying the ratios of crossing- 
over; es deutet das Chor auf ein geheimes Gesetz. What is the law that 
yields this remarkable system? It is idle to propose theories of crossing 
over that ignore or conflict with this system; it is for these relations that 
a theory is called upon to account. In view of the numerous, complex 
and extraordinary relations shown (set forth in propositions 1 to 17), 
it would appear improbable that any great number of diverse theories 
could equally well explain the results; apparently a correct theory should 
show itself unmistakably. 


THE “LINEAR” THEORY OF CROSSING OVER 


The so-called “linear” theory holds that the system set forth above 
is due to the arrangement of the material genes in a serial order; that the 
genes either are the particles seen under the microscope arranged in 
serial order in the chromosomes at the stages when the latter pair in the 
formation of the germ cells, or that at least the genes have the same 
arrangement as these particles. Crossing over, it is held, is the breaking 
and interchange of parts of the two chromosomes while they are paired 
side by side. Naturally, it is set forth, genes that are far apart become 
separated by breaks more frequently than do those close together; and 
it is to this that the system described above is due. There results a series 
of diverse ratios, from high to low, as found in experimental breeding,— 
these corresponding to genes varying in distance, from contiguity to wide 
separation. 

To judge of this theory, we require an examination of the relations 
that must hold if the genes were so arranged, for comparison with the 
relations found in the breeding experiments. Would there result from a 
serial arrangement the relations illustrated by table 2, and formulated in 
propositions 1 to 17 above? Further, would there result the particular 
numerical values actually recorded in the experimental studies? Under 
this second question we have specifically to inquire: 

(1) Whether there would result the crossover ratios found in nature,— 
limited in the way that they are; 

(2) Whether the distances between genes measured in the standard 
units of length (each such as to give 1 percent of crossing over), shown 
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in the current diagrams of the chromosomes (e.g., BRIDGES 1921) would 
result; 

(3) Whether the numbers of breaks in the chromosome, that observa- 
tion, interpreted on the linear theory, shows to occur, would result from 
the presuppositions of that theory. 


OBVIOUS COINCIDENCES AND DISCREPANCIES BETWEEN THE RELATIONS 
REQUIRED BY THE LINEAR THEORY, AND THOSE 
SHOWN BY THE RATIOS 


On the linear theory the genes in a chromosome are comparable to a 
string of beads. Two such strings (in the known pairing of the chromo- 
somes) lie side by side and intertwine or partly fuse; they break at times 
and change partners; a portion of each string reunites with a portion of 
the other to form a new string. If this were true, what would be the rules 
and ratios of crossing over? 

The obvious strong point of the linear theory is that it accounts directly 
for the facts set forth in our propositions 12 to 15,—according to which 
crossing over occurs between continuous groups of genes forming the 
series of table 2. If before crossing over the genes of the two chromosomes 
form the two series 

ASC DEF EG E 
* 8 ¢« @ @ Ff 2 8 
then after a break occurring between the genes C and D, with resulting 
change of partners, we have 
ABCdeyjpghk 
@abcDEFGH# 

And by the occurrence of two or more breaks we obtain such results as 
a ee @€ ¢€¢ f§ G 42aHhHe @€@ BF ge at 
oe eRe Pest * 2840 De 799 
But does the linear theory account for the remaining relations? If we 
hold the genes A to H to be separated by equal distances, say ten units 
each, and if we then tabulate their distances from each other, beginning 
with A, we obtain a table having many of the properties shown in table 2 

for the crossover ratios. Such a table of distances is given in table 3. 

Examining table 3, we find that it shows most, but not all, of the proper- 
ties set forth in our propositions 1 to 17, above. In detail, the identities 
and differences are as follows. The table of distances shows, if we sub- 
stitute “distances” for “ratios,” precisely the relations shown in propo- 
sitions 2, 3, 4, 6, 7, 8, 9; and the assumption underlying it accounts, as 
before remarked, for proposition 12, 13, 14 and 15. But with regard to 
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propositions 5, 10 and 11, while the relations are analogous in the two 
cases, there is a systematic difference. In the table of distances, the rela- 
tions formulated in propositions 5, 10 and 11 are exact; instead of saying 
in proposition 5 that the ratio of A to C is “a little less than the sum or a 
little more than the difference” of ratios A-B and B-C, we have to say for 
the distances that the distance A-C is precisely the sum or precisely the 
difference of the distance A-B and B-C (where A, B and C may represent 
any of the letters A to H). Similarly, in proposition 10, the sum of dis- 
tances A-Band A-C gives, not more than B-C, but just B-C; and in proposi- 
tion 11, the difference between A-B and A-C is not less than, but exactly 
B-C. This systematic “more than” or*“‘less than” in the table of ratios 
requires accounting for. Further we find in the table of distances nothing 
related to or accounting for the facts set forth in propositions 1, 16 and 17. 


TABLE 3 


Table of distances between nine serial points lying ten units apart on a line. For comparison with 
the relations shown in the table of crossover ratios. 























| 4 | B | c | D | - te | G H | I 
i Aes § | | | 

A | |} 10 | 20 | 30 | 40 | SO | 60 70 | 80 
a. | 10 | 20 | 30 | 40 | 50 60 70 
ee ae ; 10 | 20 | 30 | 40 50 60 
D ee ee eee ee 30 40 50 
E | 40 ae oe a ee | 10 | 20 30 40 
F | 50 40 | 30 | 20 | a 20 30 
G 6 |; so | 40 | 30 1 eo s 10 20 
H | 7 | 6 | so | 40 30 | 20 | 10 10 
r | so | 2 | 60 | 50 1 | 30 | 20 10 





It is of course a striking fact that such a table of linear distances shows 
a considerable number of the curious relations shown by the crossover 
ratios; this gives the linear theory a certain prima facie probability. 
Strictly, however, these coincidences do not directly support the linear 
theory, for we are dealing in the one case with ratios, in the other with 
distances. What we require is to know whether the ratios themselves 
would give such a system, if they are brought about in the way the linear 
theory sets forth. We require to know whether they would show the 
same coincidences with the system found in the genes as are shown by 
the table of linear distances. And further, would they manifest the same 
discrepancies with the relations shown by the distances, that we have just 
noted in connection with propositions 5, 10 and 11? Can the linear theory 
account for the “less than” and “‘more than” of these propositions? Does 
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it account for the relations set forth in propositions 1, 16 and 17, which 
are not shown by the table of distances? If the linear theory is correct it 
must yield and indeed require all the relations set forth in our 17 proposi- 
tions. 

More than this, the linear theory, if correct, must be consistent with the 
actual values found for the crossover ratios, in their relations one to an- 
other, and with the frequency of occurrence of single, double, and triple 
breaks, and of no-breaks, in the series given in table 1. 

Can it fulfil all these requirements? 


RELATIONS REQUIRED BY THE LINEAR THEORY 


We now proceed to an examination of the assumptions and require- 
ments of the linear theory. 

If, as it sets forth, the serial strings of genes break and cross over at 
times, either the breaks must occur as frequently between any two con- 
tiguous genes as between any other two, or they occur more frequently 
at certain points than at others. Further, conditions might be such that 
(as is commonly held to be the case), the occurrence of a break at a given 
joint would interfere with the occurrence of another near by. As a basis 
for the treatment, it is necessary to examine first the generalized case in 
which breaks are equally frequent at all joints, and there is no inter- 
ference; then the modifications required if these conditions do not hold 
may be set forth. 


THE RULES FOR CROSSING OVER WHEN THE OCCURRENCE OF ONE BREAK 
DOES NOT INFLUENCE THE OCCURRENCE OF OTHERS (NO INTERFERENCE) 


On the linear theory, in a particular chromosome, a crossover will have 
occurred between any two genes, A and M (close together or far apart), 
when a single break has occurred between them; for then the two that 
were formerly in the same chromosome will be in diverse chromosomes. 
But, as shown by the diagrams on page 401, if just two breaks have oc- 
curred between the two genes, they still remain in the same chromosome 
(though a piece between them has been exchanged), so that in experimen- 
tal breeding no crossover would be found to have occurred so far as A 
and M were concerned. And in general, consideration shows that, on 
this theory, whenever the number of breaks between two genes, A and M, 
is odd, they will have passed into different chromosomes, so that a cross- 
over has occurred for the pair A-M; while whenever there is an even num- 
ber of breaks between them, or no break whatever, A and M are still in 
the same chromosome, so that no crossover has occurred for the pair 
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A-M. (There may have been crossing over for pairs between these, but 
this will not be discovered in breeding experiments limited to observation 
of the genes A and M). 

In other words, on the linear theory crossing over occurs for any particu- 
lar pair of genes when there is an odd number of breaks between them, 
and not otherwise, and the law of the frequency of crossing over for them 
is the law of the frequency of the occurrence of odd numbers of breaks 
between them. So far as the particular pair is concerned, the effect is the 
same whether the odd number is 1 or 101 or any other. Our problem there- 
fore resolves itself into that of discovering the frequency of odd numbers 
of breaks between two genes; it is the same problem as discovering the 
frequency of odd numbers of breaks between two given beads of a string, 
when it is pulled, if all breaks occur simultaneously and are just as likely 
to occur in any one place as in any other. 

It is a simple matter to discover the rule for this. Let us suppose that 
a break occurs between any two adjacent beads in a certain proportion, }, 
of the cases; say in 45 of the cases. Then the proportion of cases in which 
a break does not occur (which we may call a) will be 1—d; here ;% 
of the cases. Consider any single adjacent pair: there is possibility of 
but 1 break between them; for convenience we may say there is but one 
joint between them. This joint will then break in +45 of the cases, not in 
3°5. We can tabulate as follows: 

Breaks 0 1 

Proportion a(=.9) b(=.1) 
Now, if we take three consecutive genes or beads, A-B-C, there are two 
joints between them. At the first joint there will be 0 breaks in ;% 
of the cases; at the second, 0 breaks in 7%; so there will be 0 breaks at 
both in;% x ;% of thecases. There will be 0 breaks at the first, 1 break at 
the second, in + X35 of the cases; 1 break at the first, 0 at the 
second, in ;)5 7%; 1 break at the first, 1 at the second, in 545 x ;5 
of the cases. Thus there will be a total of 0 breaks in (;%)* (or a*) cases; 
a total of 1 break in twice (;%x;45) (or in 2 ab) of the cases; a total of 
2 breaks in (;'5) (or 6?) of the cases. Tabulating, we have 

Breaks ee a 








Proportion a?+2ab+b6? = (a+6)* 

If we apply this reasoning to the case of three joints, we find the propor- 
tions for the different possible number of breaks, beginning with 0, to be 
a +3a*b+3ab?+b*; and in general for any number, m, of joints we find 
the proportions of the diverse numbers of breaks from 0 to to be given by 
the expansion of the binomial expression (a+))". 








ARRANGEMENT OF THE GENES IN A LINEAR SERIES 405 


Between two genes separated by joints, the proportion of crossovers 
is, as we have seen, given by the proportion of odd numbers of breaks; 
and these odd numbers are obviously given by the alternate terms of the 
expansion of (a+5)", beginning with the second; that is, those terms con- 
taining odd powers of b. Ii we can find the sum of these alternate terms, 
we shall obtain the total proportion of crossovers. 

This can be done as follows. Take as an example the case of four break- 
ing joints: 

Number of breaks 0 1 2 3 4 
Proportions a‘+4a*b + 607b? + 40°b + b* 

This expression gives the sum of the proportions of the crossovers and 
the non-crossovers; and since a+b=1, any power of a+0 is likewise 
equal to 1; so that the sum of the proportions of crossovers and non- 
crossovers is 1. If now we can likewise get the difference between the 
proportion of crossovers and that of non-crossovers, we can of course find 
the value of each. The difference between the two is obtained by sub- 
tracting the terms representing the crossovers,—those with odd powers of 
b,—from the sum of the others; that is, by giving the minus sign to the 
alternate terms containing odd powers of b. If we let N represent the 
proportion of non-crossovers, C the proportion of crossovers, this then 
gives, in our example: 

N —C=a‘—4a*b +6a*b? — 4ab* +-b* 

But this is the expression for (a—5)*, and for any value of m, it is the 
expression for (a—b)", so that N—C=(a—5)". 

Now, a is equal to 1—b, as we have seen, so that a—b=1—2b, and we 
can substitute this for a—b in the above expression. We have therefore 
obtained the two equations: 

N+C=1 

N-—C=(1-—26)" 
Subtracting the latter from the former and dividing through by 2, we 
obtain: 





00) HO ela Fh ie eb niin daa ewnene (1) 
Where C =the proportion or ratio of crossovers 
b=the frequency of a break at any joint 

n=the number of joints separating the two genes, for 
which the crossover ratio is sought. 

Formula (1) gives at once the value of the crossover ratio C, in case 
there is no interference, and a break occurs as frequently at any joint as 
at any other. By its-use we can compute at once the value of the cross- 
over ratio for genes separated by any given number of joints and with 
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any given proportion d of breaks at each joint. Thus, if there is a break 
between contiguous genes in one case out of 20, and two genes are sepa- 
rated by 10 joints, we have by (1): 

C=4-4 (4%) =35.57 percent 

This formula (1) shows at once certain important relations, and others 
are readily developed from it. It will be necessary to set forth these 
relations both for their own sake, and because these form the key to our 
later development of the relations in crossing over with interference. 

1. Formula (1) shows at once that if there were no interference, our 
first basic proposition must necessarily hold; that is, the crossover ratio 
C would vary from a very low value up to close to 50 percent as a limit, 
but would never reach or exceed 50 percent. For if a break occurs at a 
given joint less frequently than no-break, 6d is less than 4, so that 1—26, 
and hence (1—2b)", is always a positive quantity; 4(1—2b)" then al- 
ways has a positive value. But by (1), C=4 minus this positive value; it 
is therefore always less than 4, i.e., less than 50 percent. 

But (1—2b) is of course always less than 1; it is a proper fraction; and 
as higher powers are taken of a proper fraction, its value becomes less; 
by taking a sufficiently high power, its value may be made as near 0 as 
one may desire. Hence with high values of m (large numbers of joints), 
the expression 4 (1 — 2b)” approaches 0 in value, so that C, or } —$(1—26)" 
approaches 4, or 50 percent, in value. Hence, the crossover ratio could 
approach 50 percent so closely as to differ from it by less than any given 
value, but could never reach it. 

2. Itis obvious that to account for the facts in Drosophila on the linear 
theory, whether there is or is not interference, b must be less than a, and 
thus less than 4; in other words, that at a given joint a break is less fre- 
quent than no-break. For otherwise every chromosome would show many 
breaks,—many crossovers; it would not be true, as set forth in proposition 
16, that a large proportion of the chromosomes show no crossover between 
any of the genes. 

3. For later comparison with conditions resulting from interference, 
we must bring out a further consequence that would follow if, when there 
is no interference, b were equal to or greater than a. If in formula (1), b 
is given the value 4, then whatever the number of joints m, C will ob- 
viously be $; that is, the crossover ratio would be 50 percent for all pairs 
of genes. If } is given a value above 4, then the expression (1—26) be- 
comes negative in value, and as such a negative quantity is raised to 
successively higher powers, it is of course alternately negative and 
positive in value. It will therefore be found that the crossover value 
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C will be alternately above and below 3, but gradually approaching 4 
as m becomes higher. For all genes with odd numbers of joints between 
them (n= 1, 3, 5, etc.), C will be above 3; for all genes separated by even 
numbers of joints, C will be below 4. Thus, if the frequency of a break 
at any given joint were .75, the value of C would be successively, for genes 
separated by 1, 2, 3,4,..... joints, 75, 37.5, 56.5, 46.75 percent, and 
so on. 

Thus, if a break occurred more frequently at a given joint than no- 
break, the order of the genes would not be given by taking the ratios of 
the gene at the end with other genes in either ascending or descending 
order. Suppose that a break occurred at a given joint in .75 of the cases, 
then if there were five genes in the order A B C D E, the ascending order, 
used in formulating the work on Drosophila, would yield the series 
A C E D B, while the descending order would yield A B D E C,—both 
quite diverse from the actual series. But, as we have seen, in Drosophila 
b cannot be greater than a, so that this situation is not met. 

From the formula (1) for C we may readily derive a formula showing 
directly what number of joints (and therefore of genes) is required to 
obtain any crossover ratio, for any given value of b (any given frequency 
of breaks at each joint) provided there is no interference. When there is 
but one joint, 6 is equal to C; that is, the crossover ratio is the frequency 
of a break at that joint. Now, for any two diverse values of b, we require 
of course diverse values of m (number of joints) to give the same crossover 
ratio. Let these be m; and m2. Then obviously for the two cases: 

C=4$-—3(1—2b,)™ 
and 
C=4-4(1—2b2)™ 
which yield, by transposition of terms: 
(1—2b,)"=1-—2C 
(1—2b2)"=1—2C 
And since C is the same in both cases 
(1—2b2)"=(1—2b,)™ 
But if diverse powers of two different quantities are equal, it is a well- 
known relation that the two exponents are inversely proportional to the 
logarithms of the two quantities. Hence in this case, 
m, log (1—2b;) 
nm, log (1—2bs) 

Taking now the case that »,;=1; as remarked above, }; is now C; sub- 
stituting these two values in the above equation, (and dropping the sub- 
script of be, since it is now unnecessary) we obtain: 
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log (1—2C) 
a log (1—2b) 

Equation (3) gives directly, for the case of no interference, the number 
n of joints required to give any crossover ratio C, for any frequency 3}, of 
breaks at each joint.2 Thus: How many joints are required to give 20 
percent crossing over if there is no interference and the frequency of a 
break at any joint is} percent? Here C=.2 and b=.005. Hence: 

_log(1—.4) log.6 —.2218487 
” Jog (1—.01) log .99 —.0043648_ 
In the chromosomes we of course know neither the value of b (frequency 
of a break at a given joint), nor the number of joints, , that yield a given 
crossover ratio C. For determining general relations, therefore, we must 
obtain formulae that are independent of the particular values of 6 and n. 
In experimental breeding the workers who employ the linear theory have 
attempted to do this by taking as the unit of measurement that length 
of the chromosome that yields 1 percent of crossovers; this length is then 
aid off as a foot-rule on the chromosome (by processes of breeding that 
we need not go into here) the distances between specific genes, and the 
distances required to give particular crossover ratios, being thus deter- 
m‘ned in terms of this unit. This permits comparison of the distances 
required to give diverse values of the crossover ratio. It is found thus, 
for example, that genes 30 units apart do not yield a crossover ratio three 
times that of two genes 10 units apart, but definitely less; and similarly of 
other cases. To determine for the ideal linear theory without interfer- 
ence these relations, independently of the values of b and n, the following 
must be taken into consideration. In formula (1), » is the distance 
(number of joints) from one of the two genes to the other. With any given 
value of b, two different crossover ratios, C, and C2, are produced, of course, 
by different powers (values of m), as m; and mz. Hence we have: 
Ci =4 —#(1 _- 2b)™ 
C2.=4—4(1—2b)™ 

By transposition of terms, these equations yield 
(1—2b)"=1-—2C, 
(1—2b)"=1—2C, 

? It is to be noticed in using logarithms of decimal fractions for such purposes as are em- 
ployed in this paper,—and particularly in cases where the exponents are fractional,—that the 
logarithms of decimals smaller than unity are minus quantities, not the plus quantities 
(mantissas) given in the usual tables of logarithms. They are obtainable from the logarithms 
taken from the tables by subtracting the latter from 10 and giving the result the minus sign. 


Thus for .2 the logarithm obtained from the table is 9.301030; subtracting from 10 and using 
the minus sign, this yields —.6989700; for .002 the logarithm is —2.6989700, and so on. 





50.83 
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Now, it is a well known general relation that when we have two diverse 
powers of a single quantity (in this case 1—2b), the two exponents are 
proportional to the logarithms of the resulting powers: so that in this case 

m2 log (1—2C2) 
m, log (1—2Ci) 

By this equation we can therefore readily find for the case of no inter- 
ference the proportional distances required to produce, with any given 
va‘ue of b, any two crossover ratios. If it be asked: How many times as 
great a length is required to produce 40 percent crossovers as to produce 
20 percent, we have 

me log (1—.80) log .2 —.6989700 _ 
m, log (1—.40) log.6 —.2218487 ~~ 

That is, the distance between two genes that is required to yield 40 
percent crossing over would be, if there were no interference, 3.1507 
times that required to give 20 percent. 

If now we wish to use the distance required to produce some particular 
crossover ratio (as 5 percent) as a unit of measurement, we have but to 
let n,;=1, and give to C; this particular value; we may then find for any 
other crossover value C2 the number of times this unit distance that is 
demanded, by an equation derived thus directly from equation (4). 
Letting “the number of times this unit distance’ be represented by r, 
and replacing in (4) the left side of the equation by this, we have 

_ log (1—2C2) 
‘ log (1—2C;) 

For comparison with the measurements given by the workers on Dro- 
sophila, we have but to let C; in (5) be 1 percent or .01; then 1—2C,;=.98; 
this will give us the number of units, each sufficient to give 1 percent 
crossing over, that are required to produce the crossover ratio C2 We 
may replace the letter r of formula (5) by the letter U, signifying units; 
dropping the subscript of C2, as now unnecessary, we have for the case of 
no interference: 


U 


1507 








pe ae Mier (6) 
log .98 





— .0087739 
For example, in case there were no interference, how many units of 
distance are required to produce 49.9 percent crossing over? Here 
U= log .002 
— .0087739 
Again, how many units of distance are required to produce 4 percent 
crossing over? 


= 307.617 
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UV _log .99 
log .98 

Certain other numerical relations require elucidation: 

Suppose we have a certain crossover ratio C,, for a certain length. What 
will be, if there is no interference, the crossover ratio C, for a stretch r 
times as great? Here we have but to arrange equation (5) so as to solve it 
for the value of C2. Equation (5) yields directly 

log (1—2C2) =r log (1—2C,) 
Now, if the logarithm of (1—2C:2) is r times that of (1—2C,), the quantity 
(1—2C2) is of course the rth power of (1—2C,); that is, 
1—2C,=(1—2C,)" 
Whence, by transposition, etc., 
ee eo eee (7) 
2 
Thus, suppose a certain stretch on the chromosome gives 20 percent of 
crossing over. What crossover ratio will be given by a stretch three times 
as long? 
Here 





= .497 (of one unit) 


ji ap 


C2 = .392 = 39.2 percent 


Suppose that we know the distance apart of two genes in our standard 
units (each yielding a 1-percent ratio). What is their crossover ratio? 
For this we have but to solve formula (6) for C; it may be transformed, on 
the principles employed above, into 





1—.987 
INR Ss a Ge Es Lote Mae Es aoe 8 
; (8) 
Thus, suppose U=20. Then 
1 — .98”° 


C= 





= 16.62 percent 


When there is no interference, what crossover ratio is given by the sum 
of several stretches, the crossover ratio for each of which is known? Call 
the partial ratios C,, C2, C3, etc., and the ratio for the entire stretch, C. 
We could find the value of the ratio C for the total stretch by finding by 
(6) the values of U for each partial ratio, adding these values of U, and 
determining by (8) the corresponding value of C. Performing these 
operations algebraically, we find that they yield the formula 

Log (1—2C) =log (1—2C,) +log (1—2C2)+log (1—2Cs), etc....... (9) 
which is equivalent to 

1—2C =(1—2C;) (1—2C.) (1—2Cs), etc. 
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Whence 
ont Ae One i  s (10) 
2 
EXAMPLE: Four crossover ratios for successive stretches are, in chromo- 
some I of Drosophila: 1.1, 5.3, 31.1, and 10.1 percent. What should be 
the crossover ratio of the two genes at the extremes if there were no 
interference? 

C= 1—(1—.022) (1—.106) (1—.622) (1 —.202) 

%: 2 
For the common case of the crossover ratio to be expected from the sum 
of two contiguous regions, the ratio for each of which is known, formula 
(10) yields for the case of no interference the well known formula 

LF «| eee Fie ole phd ls'.I (11) 

In formula (8) we have the crossover ratio determined for a given num- 
ber, U, of units, each of such a length as to yield a 1-percent crossover 
ratio. Obviously, it makes no difference in the validity of the formula 
whether these units are all of the same absolute length or not; whether all 
contain the same number of joints or not. If the frequency of a break is 
greater at certain joints than at others, then certain of these standard 
units will contain more joints than others; but the effect of all the units 
on the crossover ratio will be the same. That is, with formula (8) we have 
emancipated ourselves from the assumption with which we began, that 
breaks are equally frequent between any two adjacent genes. All formulae 
in which these standard units are employed are similarly independent of 
the relative frequency of breaks at different joints; the consequences 
flowing from them are valid, whether this frequency is or is not the same 
for different joints. This applies practically, in fact, as will be seen, to our 
entire exposition. 

Before proceeding by the use of the above formulae to a comparison of 
the relations required by the linear theory with those found in experimen- 
tation, one other matter requires formulation. This concerns the number 
of breaks to be expected in an entire chromosome, or between any two 
genes having a given crossover ratio. 








= 36.8 percent 


The number of breaks to be expected between any two genes, in crossing 
over without interference 


As before noted, on the linear theory the crossover ratio for any two 
genes is given by the frequency of odd numbers of breaks between them. 
The number of breaks might run up into the hundreds; if the number were 
odd, a crossover would result, exactly as if there were but one break. 
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But the conditions in inheritance do not admit of large numbers of 
breaks. If we interpret on the linear theory the facts set forth in our prop- 
ositions 12 to 16, page 398, we are forced to assume that in a large 
proportion of the cases there is, for a particular chromosome, no break 
whatever (proposition 16, page 399) ; that for most other cases there is but 1 
break (propositions 12 to 14); that two breaks occur only very rarely, 
and that more than two occur with extreme rarity. 

The question arises whether these assumptions are consistent with the 
fundamental postulates of the linear theory. If the genes are in series and 
crossing over occurs by breaks in this series, would the numbers of breaks 
in the chromosomes be those which the breeding results, interpreted on 
the linear theory, require? 

We have seen on page 404 that the proportions of the cases showing 
the different possible numbers -of breaks from 0 to m are given by the 
successive terms of the binomial (a+6)". The exponent of } in each term 
indicates the number of breaks of which the total value of that term gives 
the proportional number. For any given number of breaks d, the propor- 
tion will be that shown by the value of the term containing b¢. The term 
n\a™—%p4 
d!(n—d)! 
of cases in which just d breaks occur between two genes separated by n 
joints, then (if we substitute for a its value 1—b) we have: 

n! (1 —b)"-9 p4 
‘dl (n—d)! 

For our later purposes, this and similar factorial expressions are better 

written in such a way as to make the factors in the numerator and denomi- 

n! 
d!(n—d)! 
pression in which the numerator consists of d factors, beginning with n 
and descending by differences of 1, while the denominator is factorial d. 
Methods of writing in a formula “‘d factors beginning with m and descend- 
ing by differences of 1” vary, and all those that I have observed are 
cumbersome. There appears to be no ambiguity involved ‘n writing 
this simply m"’, and I shall therefore employ this method. Thus 234 = 
23X22 X21X20. Using this method, formula (12) becomes 
n'? (1—b)"-4 pe 


containing b‘ is of course If we let By represent the proportion 





nator as few as possible. The expression, is equivalent to an ex- 





B= 7] (13) 


(Where d! =factorial d, and m"=d factors beginning with and descend- 
ing by differences of a single unit.) 
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This formula gives us the proportion of cases having any number d of 
breaks when the value of » and of a and } are known. But in practice 
these are not known, except that d is less than .005. What we desire to 
know is the proportion of chromosomes having each particular number of 
breaks for a given crossover ratio. In other words, we wish to know the 
value of B, in terms of C, the crossover ratio. 

For this purpose, we may begin with Bo, the proportion of chromo- 
somes that have no break. This is, as we have seen (1—))". 

Now, by formula (3), page 408, the number of joints m is given by the 
formula: 

log (1—2C) 
PR: Fe NR ad 
log (1—26) 

We can therefore substitute this for » in the expression Bo=(1—6)"; 

in other words the value of Bo is given for any value of C by raising 1—b 


log (1—2C 
to the power represented by rE Sy To do this, we may of course 
og (1— 
log (1—2C) 


multiply the logarithm of (1—0)by and this will give us the log- 


log (1—2b)’ 
arithm of Bo. Performing this operation (and changing the order of the 
factors), we obtain: 


log (1—6) 
Log Bo=1 | I) ON a eR ee eee 14 
og Bo=log ( ) log (1-28) (14) 
Now, as we have seen, b is necessarily less than .005. It will be found 
log (1—b) . 
that for b=.005, the value of the expression ae. is almost exactly 3 
log (1 —20) 


(it is .49874), and that as b is made smaller, this expression approaches } 
as a limit. For all of the possible values of C it may be therefore taken as 
3, so that our equation becomes 
Oo ee es ee ee ere re (15) 
But of course if log Bo is} of log (1—2C), it follows that Bo is the square 
root of 1—2C. We obtain then the following 
yo cer gs Bek are (16) 
That is, if there is no interference, for two genes showing a given 
crossover ratio C, the proportion (Bo) of cases in which there is no break 
between the genes is the square root of (1—2C). Thus if the crossover 
ratio is 10 percent (C=.1), the proportion of cases in which there is no 
break between the two genes is »/.8, which is 89.44 percent. If C=40, 
Bo= /.2 =44.72 percent. 
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We have then the value of Bo expressed entirely in terms of C, without 
regard to the precise value of m and 6. To obtain corresponding expressions 
for B,, Bz, and in general for Bz, we may note’'the following: The value of 
Bo is (1—6)", and this is, therefore, as we have just seen, equivalent, in 
the cases with which we are dealing, to /1—2C. We may there- 
fore substitute 1/1—2C for (1—0)" in formula (13) for By. Noting that 

(1—5)" 
1—b)*-4) = ; 
n'* be 
By=———— _ 
“aaa ¥! 








we thus obtain: 





In this formula, the expression n signifies d factors, beginning with n 
and descending by differences of 1. For our purposes, as we shall see, only 
low values of d have to be considered,—from 0 up to 5, or thereabouts. 
Furthermore, as before remarked, we know that the value of bis less than 
.005. We wish to determine whether, as is increased, there is a fixed or 
limiting value for the proportion of cases having any particular number of 
breaks, as 0, 1, 2, 3, etc. 

Now, for high values of m, with low values of 6 and d, we can substitute 
n* for n in formula (17), without making any appreciable difference in 
the result, as will be seen if b is given the value of .005, d any value up to 
10; and m any value above 20. In no case will there be a difference in the 
result greater than a unit in the fourth decimal place, which is, of course, 
negligible. For the conditions in crossing over, therefore, we can make 
this substitution, and with slight change in the arrangement of the terms 
we obtain from (17) the following: 


d _ 
Bux (~) wt a, a PR Se by ee ae (18) 
Now, as we have seen earlier, when »=1, b=C, so that in this case 
mb  C 
18 1-3 


For any higher value of , with the same value of C, we have by equa- 
tion (3), 


_ log (1— 2C) 
log (1—26) 

From this equation we can obtain the values of b and of 1—), in terms 
of m and C. The steps may be indicated as follows. We have directly 
log (1—2C) 

n 


log (1—2) = 
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Now, if log (1—26) is equal to log (1—2C) divided by 2, it of course 
follows that 1—20 is the mth root of 1—2C; that is 


1-—2b=*/1-2C 
whence 
pic vi=2 
- 2 
d 1+4/1-2C 
= pba Te 


From which it follows that 
nb _n(i-V1—2C) 
1-—b 14+4/1-2C 





and therefore 
3- (SY. ae (19) 
i+Vi1-—2 d! 

Now, in the expression 7/1—2C, the quantity under the radical is 
necessarily a proper fraction (since C is less than $). As we extract succes- 
sively higher roots of a proper fraction, it becomes greater, approaching 1 
asalimit. Thus, for high values of m, the expression /1—2C approaches 
1 as a limit. Hence, in the first member of the above value for By, the 
denominator, 1+4/1—2C, approaches 2 as a limit, as m becomes large. 

In the case of the numerator, m (1—+/1—2C), if the numerical values 
of this expression are worked out for a given value of C with successively 
higher values of ”, it becomes obvious that it is approaching a limit. Now 
the limit of the expression »(4/x—1) is the natural logarithm of x. Our 
expression is of the same form, with (1 —2C) taking the place of x, and with 
the signs reversed. That is, the limit of m (1—4/1—2C) is the minus 
natural logarithm of (1—2C).* Since 1—2C is itself a proper fraction, the 
minus natural logarithm is itself a positive quantity. 

Substituting’ the two limits we have thus obtained, in formula (19), 
we get: 














[—natlog(1—2C)]?, W1-—2C 
Bi = 
- 24 d! 
We have now obtained an expression for the value of B, corresponding 
to any given value of C, that is quite independent of the particular values 





*I am indebted to Professor L. S. HutBurt of the Mathematical Department of THE JouNns 
Hopxins University, for pointing this out to me. The natural logarithm of any number is of 
course obtained from its common logarithm by multiplying the latter by 2.30258509. In 
practice, the natural logarithm of a decimal fraction is obtained by taking its logarithm from a 
table, subtracting 10 from it, and multiplying the remainder by 2.30258509. 
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of m and of b, provided that n is large and d is small (as we know to be the 
case). 

An example will make clear the use of formula (20). If the crossover 
ratio is 25 percent, in what proportion of the cases are there three breaks 
between the two extreme genes? Here d=3, while C=.25; so that the 
proportion of ‘cases yielding B; is: 

(—nat log .50)® _ +/.50 
. 28 3! 

09314719 _, .70710678 
—— / 6 

The values given by this formula are the limiting values in case m is a 
large number, and this condition is fulfilled when the frequency of a break 
between two contiguous genes is small; that is, when 6 is small. We 
know that 6 is certainly smaller than .005. The lowest possible values of 
n will therefore be higher than those corresponding to the value of .005 
for b; these may be obtained for any value of C by equation (3), page 408. 
If now we work out by formula (13), page 412, the values of Bo, Bi, Bs, Bs, 
etc., for b=.005, we find that these do not differ, for any possible value of 
C, or for any number of breaks from 0 to 5, by any. appreciable amount 
from the values given by the general formula (20); in no instance is the 
difference as great as one-tenth of one percent. For example, for 25 per- 
cent crossing over, if we take the value of } to be .005, and determine by 
(3) the number of joints necessary to produce this crossover ratio, then 
work out directly by the binomial formula (13) the proportion of casés 
yielding 0, 1, 2 and 5 breaks, we obtain the result given as (A) below, as 
compared with the proportions in (B), worked out by the general forn ula 
(20): 


B; 








= .004906, or .49 of 1 percent 





Number of breaks 0 1 2 5 
(A) from binomial formula (13)....... . 70671 .24507 .042177 .0003955 
(B) from general formula (20)........ .70710 . 24594 .042467 .0004251 


The values given at (A) are lower than the lowest possible limits, those 
at (B) are the highest possible limits, for the proportions. Such minute 
differences are obviously negligible, so that the values obtained from 
formula (20) may be taken as the correct proportions of cases showing 
the different number of breaks. In table 4 are given these proportions 
for a series of values of the crossover ratio C. 

Table 4 shows in general that for moderate values of the crossover ratio, 
the number of breaks to be expected is small, much as appears to be the 
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case in nature, if interpreted on the linear theory. More precise com- 
parisons are found in the following section. 


TABLE 4 


Linear theory ; no interference. Showing for typical diverse values of the crossover ratio C, the distance 
apart of the genes (in units each of a length to give a crossover ratio of 1 percent), com- 
puted by formula (6), and the percentages of the cases in which there would be 
0, 1, 2, 3,4 or 5 breaks between the two genes yielding the given 
ratio C computed from formula (20). 





























PERCENTAGE THAT SHOWS THE INDICATED NUMBERS OF 
_ DISTARCE 38 BREAKS, FROM 0 TO 4 BETWEEN THE TWO GENES 
c UNITS OF 1 
nit cemeted o | 1 2 3 4 5 
1 1.00 Cee | GOb Ae eres 
5 5.22 94.868 | 4.998 | 0.132] 0.002 | 0.00003 
10 11.05 89.443 | 9.979] 0.557 | 0.021 | 0.0006] ........ 
15 17.66 83.666 | 14.928 | 1.330] 0.079 | 0.0035 | 0.000001 
20 25.29 77.428 | 19.858 | 2.497 | 0.205 | 0.0124 | 0.000006 
25 34.31 70.711 | 24.507 | 4.247 | 0.491 | 0.0425 | 0.000029 
30 45.36 63.199 | 29.074 | 6.615 | 0.992 | 0.1104 | 0.000106 
35 59.60 54.795 | 33.046} 9.905} 1.963 | 0.2885 | 0.000361 
40 79.67 44.721 | 35.988 | 14.480] 3.884 | 0.7814 | 0.001258 
45 113.97 31.623 | 36.407 | 20.958 | 8.043 | 2.315 | 0.005330 
49 193.64 14.141 | 27.598 | 26.930 | 17.518 | 8.547. | 0.033743 
49.9 307.61 4.472 | 13.928 | 21.590 | 22.362 117.372 | 0.107957 








COMPARISON OF THE CONDITION RESULTING FROM SERIAL ARRANGEMENT 
OF THE GENES, IF THERE WERE NO INTERFERENCE, WITH THOSE FOUND 
IN NATURE 


By the use of the formulae and tables set forth above, it becomes a 
simple matter to determine how far the conditions demanded by the 
linear theory, would, if there were no interference, agree with those 
found in experimental breeding,—as summarized in propositions 1 to 17, 
pages 397-399. It will be well to examine these relations here, for com- 
parison later with the more complex and less perspicuous case in which 
there is interference. For this purpose it will be helpful to tabulate the 
crossover ratios for a number of genes, A to H, that are successively 10 
units apart (each unit of length to yield a one-percent crossover ratio), on 
the assumption that there is no interference. This is of course readily 
done by the use of formula (8). The results are shown in table 5. 

Table 5 shows that if there were no interference, all of the general rela- 
tions set forth in our fundamental propositions 1 to 17 (pages 397-399) 
as requiring explanation would be fully accounted for, except that num- 
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bered 17. We have already seen that the linear theory accounts immedi- 
ately for proposition 1 (page 397) and propositions 12 to 15 (page 398). 
We saw also that the distances between objects in serial arrangement 
show the relations numbered 2, 3, 4, 6, 7, 8, 9, but do not show accurately 
those numbered 5, 10 and 11 (see page 397, and table 3). Examination 
of table 5 shows that when we take the required crossover ratios, in place 
of the distances, all these relations (propositions 1 to 11) are fully ac- 
counted for. In the case of propositions 5, 10 and 11, which the distances 
did not correctly fit, the required ratios (table 5) show precisely the con- 
ditions found in the ratios discovered in breeding. The theory requires 


TABLE 5 


Crossover ratios for eight genes, A to H, if they are separated by successive stretches each 10 units in 
length (when there is no interference). 


























| A B ¢ D E F G H 

| 
A RM 9.2 | 16.8 9 1 27.8 | 31.6 | 38.2 | 37.7 
B | 9.2 es 9.2 6.8 | 2.9 | 7.53 | 31.6 | 35.2 
c | 16.8 9.2 a 2 16.8 | 22.9 | 27.5 31.6 
D 22.9 | 16.8 9.2 9.2 | 16.8 | 22.9 | 27.5 
E 27.5 | 22.9 | 16.8 9.2 veh 9.2 | 16.8 | 22.9 
F 31.6 | 27.5 | 22.9 | 16.8 9.2 Poot 9.2 | 16.8 
C 35.2 | 31.6 | 27.5 | 22.9 | 16.8 9.2 ae 9.2 
H 37.7 | 35.2 | 31.6 | 27.5 | 22.9 | 16.8 9.2 








precisely that, as in proposition 5, the ratio of gene A to gene C should 
not be just equal to the sum or the difference of ratios A to B and B to C, 
but should be either somewhat less than the sum, or somewhat more than 
difference; this will be seen by examination of the ratios in table 5. In the 
same way, the theoretical ratios of table 5 show precisely the peculiar 
relations of the actual ratios that are formulated in propositions 10 and 
11. Further, table 4 shows that, as set forth in proposition 16, a consider- 
able proportion of the chromosomes would be found that show no breaks 
whatever even between genes lying far apart (as indicated by the high 
crossover ratio they yield). 

It remains to inquire, as set forth page on 403, how far the requirements 
of the linear theory, without interference, are consistent with the actual 
numerical values of the ratios discovered in the experimental studies, 
and with the actual numbers of breaks in the chromosome that observa- 
tion, interpreted on the linear theory, shows to occur. We shall at the 
same time inquire how far the distances apart of the genes, represented 
on the maps constructed by the students of Drosophila, are consistent 
with this form of the linear theory. 
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With regard to the observed numerical values of the ratios, we find, by 
application of formula (10), page 411, that the actual ratios for genes 
that are, on the linear theory, far apart, are systematically too high in 
comparison with the ratios for the partial distances of which these longer 
stretches are made up, so that they could not be accounted for on the 
linear theory without interference. For making these comparisons, it is 
necessary to employ only ratios that have been determined with sufficient 
accuracy to be fairly dependable; we will arbitrarily select as such those 
based on statistics from at least 5000 cases. We find determinations based 
on numbers above 5000 for several stretches of chromosome I of Droso- 
phila given by Morcan and BrincEs (1916), and certain additional ones 
are obtainable from the data of PLoucu (1917). For the shorter stretches 
of chromosome I we find the following ratios in MORGAN and BRIDGEs: 
yellow-white, 1.1; white-vermilion, 30.5; vermilion-miniature, 3.1; 
miniature-rudimentary, 17.9. 

If now we compute from these the ratios for longer stretches required 
by the linear theory if there is no interference, we find these required 
ratios to be the following, in comparison with the observed ratios: 


Required Observed 
Yellow-miniature. . . : = 31.6 34.3 
White-rudimentary. . ¥: 38.2 42.4 


Again, we find another set of satisfactorily determined shorter stretches, 
as follows: White-vermilion, 30.5; vermilion-sable, 10.1; sable-bar, 13.8; 
bar-fused, 2.5. 

Computing from these the required ratios for the longer stretches, and 
comparing with the observed, we have: 


Required Observed 
White-bar..... ar 38.7 43.6 
Vermilion-bar. . . a 21.1 23.9 
Vermilion-fused. . . is al 22.5 25.8 


From PLoucH (1917) we have the following additional data: sable- 
garnet, 1.4; garnet-forked, 12.0; giving: 
a Required Observed 
Vermilion-forked............... 20.0 22.5 


For chromosome II, Brmcres and Morcan (1919) give the following 
ratios, each based on more than 5000 cases: star-black, 37.9; black-curved, 
22.7; curved-speck, 30.2; yielding for longer stretches: 


Required Observed 
I 5,5 a's. 6.0.55 oe sig.sic sis sents 47.4 48.3 
Gs i 5.0.s ticgewsc ca ¥ew 43.4 45.9 
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With the additional ratio black-purple 6.2, we obtain: 


Required Observed 
Star-purple. .. Rok kanes. 39.4 43.7 


We have also the ratios dachs-black, 17.8; black-vestigial, 17.8, which 
yield: 
Required Observed 
Dachs-vestigial. . . : “€ 29.3 29.6 


Further, from the ratios black-arc, 42.6; arc-morula, 7.9; purple-curved, 
19.9; and curved-speck, 30.2, we obtain the comparisons: 


Required Observed 
Black-morula. . 3 <faiina 43.8 46.6 
Purple-speck. . . bs ees 38.1 45.7 


For the third chromosome, GOWEN (1919) gives the following ratios 
(each based on 31,456 cases): 
Sepia-dichaete, 11.5; dichaete-spineless, 9.9; spineless-sooty, 10.6; sooty- 
rough, 20.3. 

For the longer stretches these yield the following comparison of required 
and observed ratios: 


Required Observed 
Sepia-rough...... ne 3,9 39.1 
Dichaete-rough........ * 32.0 36.1 
Sepia-sooty..... Se a 25.7 26.2 


Thus, it is a fact that the crossover ratios between genes that are (on the 
linear theory) far apart, are found to be regularly greater than is possible 
on the linear theory without interference, when we consider what the 
ratios are for shorter intervening stretches taken separately. It is there- 
fore clear that the linear theory without interference cannot account for 
the actual numerical relations found among the ratios. 

As to the actual numbers of breaks occurring between giyen distant 
genes of a chromosome, determinations, based on the theory of the linear 
arrangement of the genes, have been made by MULLER (1916), PLoucH 
(1917), WEINSTEIN (1918) and GoweEN (1919), for the first, second and 
third chromosomes of Drosophila melanogaster. Certain unpublished 
determinations for the first chromosome of Drosophila virilis have been 
furnished me, further, by WEINSTEIN. To compare the proportions of 
these observed numbers of breaks with the proportions required by the 
linear theory without interference, the observed proportions having no 
break (Bo) must be taken as the point of reference, since these (as set 
forth later) are not altered by interference. From equations (16) and (20) 
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a formula is readily obtained for determining the proportion (B,) of cases 
showing any given number of breaks d, corresponding to the proportion 
with no breaks (Bo); it is: 

- (—nat log B,”)? . Bo 
24. d! 

In order to avoid repetition, the results of this comparison are tabulated 
on a later page, in connection with those of a similar comparison for 
different extents of interference, in table 13, page 449. As will be found on 
examination of this table, in every case the proportion of cases with one 
break is in nature much too high, while the proportion with more than one 
break is much too low, as compared with what is required by the linear 
theory without interference. For example, in GOWEN’s study of the 
numbers of breaks in the third chromosome of Drosophila, based on 
31,456 cases, with 54.6 percent showing no break, the proportion required 
by the theory is 33.1 percent with one break, while the observed propor- 
tion is 38.9 percent. On the other hand, for two breaks the theory requires 
10.0 percent, while only 6.3 are found; theory requires 2.0 percent with 
three breaks, but only 0.3 percent are found; and these results are typical. 

Thus the linear theory without interference will not account for the 
proportions having the different numbers of breaks actually observed; the 
latter show an excess of those with but 1 break; a deficiency of those with 
2 or more breaks. 

We may inquire finally as to the consistency of the “map distances” 
between the genes, given by the students of Drosophila, with the linear 
theory without interference. These map distances are of course not 
matters of direct observation, but they are determined by such careful 
methods that we may accept them as approximately what must be the 
actual relations, if the genes are arranged serially. Table 6 shows the 
“map distances” given by BripGrs (1921) for the more important genes 
of chromosomes I and II of Drosophila, in comparison with the distances 
required by the linear theory without interference. The map distances 
are of course given in units, each of such length as to yield by itself a 
1-percent crossover ratio. For chromosome I, I have used the distances 
from the gene white instead of the gene yellow, since the crossover ratios 
from the former have been much more accurately determined. These 
ratios are given in table 2. The ratios for the second chromosome, from 
the end gene, star, are those given by BripGes and Morcan (1919). 
From these ratios the number of units required by the linear theory with- 
out interference are computed from formula (6). 





Bi 
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As table 6 shows, if there were no interference, the distances (measured 
in units of length to give 1 percent crossing over) required to give the 
observed crossover ratios would be much greater than those determined 
by BripGEs in the chromosomes of Drosophila. 

In sum, therefore, we find that if the genes were arranged in linear series, 
and crossing over took place by breaking and exchange of partners, and 
if a break at a given joint did not interfere with a break elsewhere, then 
all of the general relations found to hold among the observed ratios, 
summarized in our basic propositions 1 to 17 (page 397), would necessarily 
hold, except the last one (number 17). 

TABLE 6 
Comparison of the map distances given by BripGEs (1921) with those required by the linear theory 
without interference. 














CHROMOSOME I } CHROMOSOME II 
— . - 
ini | Map distances | oe — Map distances Rewsleed 
BrivcEs 1921 | BripcEs 1921 
| ference) | 
White... 0.0 | 0.0 | Star 00 | 0.0 
Bifid. . 5.8 Streak... 15.4 18.2 
Club. . 15.2 16.7 Dachs A 29.1 42.5 
Vermilion 31.5 | 46.6 | Black 46.5 70.2 
Miniature 34.6 54.0 Purple. . on.d | 102.8 
Sable... 41.5 | 86.0 | Curved...... | 73.5 | 123.8 
Rudimentary 53.0 93.2 | Speck. a 105.3 167.4 
Bar... 55.5 101.8 | | | 





But in addition to this discrepancy as to proposition 17, we find three 
discrepancies as to the numerical relations involved. These are: 

(1) The observed numerical values of the ratios could not occur; the 
observed ratios for distant genes are regularly greater than they would 
be on the assumptions set forth above. 

(2) The proportions of the chromosomes showing the different numbers 
of breaks, from 0 to 3 or 4, would not be those actually found; the latter 
show a marked excess in the proportion of those with 1 break, and a corre- 
sponding deficiency of those with 2 or 3 or more breaks. 

(3) The “map distances” of the genes that have been determined in the 
chromosomes of Drosophila are much less than are required by the linear 
theory without interference. 


THE HYPOTHESIS OF INTERFERENCE 


To account for the relation set forth in proposition 17, namely, that 
when a crossover occurs between two genes having together a low crossover 
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ratio, no crossover occurs between one of these genes and any other gene 
having a low ratio with it and lying on the same side of the crossing-over 
break, in the series given by such a table as table 2, we know that the 
students of Drosophila have proposed the hypothesis of interference. 
This asserts that the occurrence of a break at a certain joint in the linear 
series interferes in some way with the occurrence of another break at any 
joint near by. Various suggestions have been made as to how such inter- 
ference might be brought about; with these we are not concerned. 

The hypothesis of interference was proposed only to account for the 
relation set forth in our proposition 17, not at aJl with reference to the 
three numerical discrepancies above set forth. We have now to examine 
whether it would in fact do away with these three discrepancies; whether, 
in other words, interference would (1) increase the crossover ratios of 
distant genes in proportion to those of genes close together; (2) increase 
the proportion of cases with a single break at the expense of those with 
more than one break; (3) decrease the “map distances” in the chromo- 
somes. Would it have these effects to just the extent required to bring 
the facts into consonance? And of course it might turn out that inter- 
ference would disturb some of the other relations, set forth in basic 
propositions 1 to 16: this would be fatal. 

It is obvious that the fulfilment of these conditions imposes a severe 
test on the theory of interference. To this test we now turn. 


CROSSING OVER WITH INTERFERENCE, IN CASE THE GENES ARE ARRANGED 
IN A SERIES 

We next examine the conditions which must hold if crossing over occurs 
in linear series of genes in the method set forth in the linear theory, but the 
occurrence of a break at a certain joint interferes with the occurrence of 
another break at any joint within a certain distance. 

When there is no interference, the proportions of the chromosomes 
showing the diverse numbers of breaks, from 0 up, between genes 
separated by joints, are given, as we have seen by the successive terms of 
the binomial (a+6)", where b is the frequency of a break at any joint, 
and a=1—8. 

Giving, for example, the value 4 to m, we have the following propor- 
tions: 

Number of breaks 0 1 a FA 
Proportions a‘'+4a*b + 6a7b? +4ab?+5'=1 

Now, in case there is interference, it is evident that this formula suffers 

the following transformations: 
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(1) The proportion showing 0 breaks (a") remains unaltered, for inter- 
ference is effective only after the chromosome concerned has a break. 

(2) All those that would give a single break if there were no interference 
—that is, the second term, n.a"~"b,—still give a single break, since 
interference does not come into action until after there is one break. 

(3) The proportions that would give respectively 2, 3, etc., breaks,—that 
is, the terms beyond the second—are now reduced, some of their compo- 
nent parts giving in case of interference a smaller number of breaks. 

Thus, those chromosomes which would have 2 breaks (the proportion 
of which is 6a7b? in the above instance) will some of them, if there is 
interference, have but 1 break. All those in which the two breaks would 
be within the distance over which interference occurs, will now have but 
one break; the rest will retain two breaks. 

Similarly, for the proportion (fourth term) that would otherwise have 
3 breaks; if all these three breaks are within the distance over which 
interference is effective, there will be but 1 break; if but two are within 
this distance, these two will be reduced to 1, while the third will remain; 
so that the chromosome will have but 2 breaks in place of 3. In the same 
way, some that would have 4 breaks will now be reduced to 3, 2 or 1 
break; and so of all the other terms. 

(4) That is, the proportion that has just one break will in case of inter- 
ference, be increased by contributions from all the succeeding terms. The 
proportion for any greater number, as 2 breaks, will be decreased by the 
contributions which it makes to those having but 1, but increased by 
contributions from all the terms for higher numbers of breaks. 

It is thus obvious that among the general results of interference as 
compared with the results if there were no interference, are the following: 
(a) the proportion of chromosomes with no breaks is unaltered; (b) the 
proportion with a single break is increased. This, as we saw on an earlier 
page (421), is what is required to bring the results as to numbers of breaks 
into consonance with the linear theory. Whether it will do this in the 
correct proportions, and whether it will, as required, increase the crossover 
ratios between distant genes in comparison to those between genes that 
are near together, and decrease the map distances; also whether it will in 
any way disturb the consonance of the observed relations with the linear 
theory,—these remain to be determined. 

To determine these things, we require formulae giving the crossover 
ratios and the numbers of breaks occurring, in case there is interference; 
we can then compare the results with those from crossing over without 
interference. A general formula for the numbers of breaks occurring in 
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case of interference would be obtained from the binomial expression 
(a+6)", by determining what part of each of the terms is to be trans- 
ferred to the terms preceding it. What part of those that without inter- 
ference would show two breaks, will now show but one? And so of the 
remaining terms. 


Complete interference 


We shall deal first with the relatively simple possible case that a single 
break in the chromosome prevents the occurrence of another break in this 
entire chromosome. As before, we let } represent the frequency of a break 
at any particular joint so long as interference has not come into operation, 
while a represents the frequency of no-break at any particular joint, under 
conditions of no interference. The total number of joints in the chromo- 
some we designate m; interference extends over the entire joints. 

When there is no interference, as we have seen, the relative numbers of 
chromosomes showing the different possible numbers of breaks, from 0 
up, are given by the successive terms of the binomial (a+0)". If, when 
there has occurred a single break, no other occurs, it is obvious that all 
the chromosomes will now show:a single break, except those which, in 
case there were no interference, would show no break. The proportion 
of these latter is given by the first term of the binomial (@+5)"; it is of 
course a”. The proportion of the chromosomes showing a single break is 
therefore in the case of complete interference, 1—a"; and this of course 
will be the crossover ratio, as well as the value of B, (the proportion of 
cases having a single break). We therefore have for the case of complete 
interference for the entire length of the chromosome the formula: 

C=B,=1—a"....... ee ee ee (21) 

From this formula it is clear that, if interference is complete throughout 
the length of the chromosome, C may have any value between 0 and 1. 
Crossover ratios above 50 percent may occur as readily as ratios below 
50 percent. For whatever value be given to a (the proportion of cases in 
which a given joint is not broken, so long as interference has not come into 
action), if » be taken sufficiently great a" will become less than $, so 
that C will be greater than} or 50 percent. Thus, if for 6 we take the 
super-maximal value of .005, so that a=.995, then for n=138.28, C=} 
or 50 percent; for m=277, C=75 percent; for nm =455, C=90 percent. In 
general the number of joints 7, between the two extreme genes, necessary 
to give for them any particular crossover ratio C is given by transforming 
formula (21) as follows: 

1—C =a" 
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Whence 
log (1—C) 
n= = anu dahatale Ltecgiiesd Se 
log a 

In formula (22) oi course ” is the total number of joints in the chromo 
some, and C the crossover ratio for the two end genes. 

In the case of complete interference, if any given joint shows a break 
as frequently as any other, the actual mean frequency of a break (which 
we may call 8, to distinguish it from 6, the frequency when there is no 
interference) is, of course: 

1-—a’ 


B= ody BAe, ( 


i 


bdo 
w 


And for genes separated by any number, m, of joints the crossover ratio 
is the frequency of a single break in the total m joints, so that: 
, m(1—a") 
C=B, ‘ ; - — (24) 


nN 


Where w is the total number of joints in the chromosome.) 

In this case, for comparing the crossover ratios of two stretches of the 
chromosome, it is evident from (24) that their ratios are directly propor- 
tional to the number of joints they contain. That is, if C; is the ratio 
for m, joints, then for C., the ratio for me joints, we have 


C2 Me mC, Pe 
—-=—, whence C.= oe edged (2 
Ci my, my, 


uw 


And of course the crossover ratio for the sum of two or more contiguous 
stretches of the chromosome is the sum of the ratios for the several 
stretches taken separately. The value of the crossover ratio between any 
two genes would be the same as the number of units (each yielding 1 per- 
cent of crossing over.) 

No cases are thus far known of complete interference; the observations 
show that, interpreted on the linear theory, some of the chromosomes 


show more than a single break. We proceed therefore to examine: 
Partial interference 


In partial interference, the occurrence of a break at a given joint inter 
feres with another break within a certain distance from that joint, but 
this distance is not so great as to include the entire length of the chromo- 
some. Hence, two or more breaks may occur in the chromosome, depend 
ing on its length. 
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The generalized problem of partial interference is complicated by the 
fact that the results differ slightly, depending on the order in which the 
fate of the different joints of the chromosome is determined. This may be 
illustrated by a simple example. Suppose that when a break occurs, this 
prevents the occurrence of another break for three joints in each direction. 
Then if in the 6 joints shown below, there would without interference 
occur three breaks (0), at joints 1, 3 and 5, 

i252 5-8 

b b b 
as shown, the number of breaks remaining in case there is interference 
will depend upon which of the three breaks is produced first. If number 1 
-is produced first, this will interfere with number 3, but not with number 5, 
so that there will remain two breaks. But if number 3 is produced first, 
this will interfere with both the other two breaks (since these are both 
within 3 joints of it), so that there would now remain but a single break. 
The order of determination affects in other ways also the proportions 
showing particular numbers of breaks. 

If, however, the linear theory holds for the chromosomes, it is extremely 
probable that the determination of the fate of the joints occurs in serial 
order, from one end of the chromosome to the other. For the cytological 
evidence indicates that when the conjugation of the chromosomes occurs, 
in the. very long thread-like stage, the union begins at one end and pro- 
gresses toward the other;and the separation of the two united chromosomes 
appears to take place in a similar manner. The determination of the 
fate of the joints must apparently occur either when the two chromosomes 
are uniting, or when they are separating. 

It appears therefore that a solution of the problem for determination 
of the fate of the joints successively in linear order, is what we require. 
In any case, however, it appears that when the frequency of breaks is 
small, and the number of joints is large, as we know to be the case in the 
chromosome (if the linear theory holds), the difference in the results due 
to different orders of determination would be so minute as to be entirely 
negligible.. A number of typical particular diverse orders worked out 
gave results differing only in the fourth or fifth decimal place or beyond.* 

‘If it be held that the first break occurs as frequently at any one joint as at any other, and 
that any given order of determining the fate of the entire m joints occurs as frequently as any 
other; then there would be ! equally frequent orders of determination. By transforming, in 
accordance with these principles, the binomial (a+))" into an expression giving the mean result 
for all these 2! orders of determination, in case of interference extending for any given number of 
joints, a solution under these conditions would be obtained. Apparently such a solution would 
not differ in numerical results appreciably from that given in the text. 
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We assume then that the determination of whether each joint is to 
show a break or no break occurs in serial order, from one extremity of the 
chromosome to the other. Thus in the series 

. 2 3s 2 oie 2 


the fate of number 1 is first determined, then that of number 2, and so on. 
(Determination proceeding serially from the middle in both directions, as 
might occur in V shaped chromosomes, would give essentially identical 
results.) 

We shall, as in the case of no interference, first develop formulae in 
which 6, the frequency of a break at any given joint, and m, the number 
of joints, are assumed to be known; from these we shall then obtain general 
formulae showing the relations even when 6 and m are not known. 

It is convenient to designate as k the number of joints whose fate is 
determined (in one direction, of course, if the order of determination is 
serial), when a break occurs at a given joint, including in & the joint at 
which a break occurs. Thus, if, when a break occurs, this prevents a 
break in the four succeeding joints, the value of & is 5. 

The problem of the results of interference ina series of joints, in case 
is greater than the distance of interference, may be attacked diréctly, in 
the following manner. 

The chromosome has » joints between its +1 genes. At each joint, so 
long as interference does not come into operation, a break occurs in the 
fraction 6 of all cases; no-break in the fraction a (=1—06). Whenever the 
fate of a certain joint is determined as no-break, this has no effect on the 
fate of succeeding joints. But when a certain joint receives a break, then 
a certain number (k—1) of the succeeding joints are determined as yield- 
ing no break, so that at any break a total of k joints is determined. 

For convenience of illustration, we will designate a break by the letter 
b; no-break (when independently determined) by the letter a; these will 
also represent the proportions of cases in which these conditions occur. 
We may further designate a “no-break’”’ which is determined by a fore- 
going break, by the Greek letter a. If then, for example, »=8, k=4, 
and the first break occurs at the third joint, while no other occurs, we 
represent the condition as follows 

aabaaaaa 
And the proportion of cases in which such a result will occur is given by 
the product of the a’s and the b’s; so that in this case the proportion is 
a‘b (that is, in .06561 of all cases, if a=.9, b=.1). The a’s do not have 


to be included in the product, since they occur in all cases after a b has 
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occurred at the given point; the value of each a as a factor would be 1, 
thus not altering the result. 

Now, the problem which we have to solve is this: in what proportion of 
the cases would no 8, 10, 20, etc., occur in m joints, if a 6, on its occurrence, 
is inevitably followed by (k—1) a’s (no-break)? For convenience, let us 
represent, as before, the proportions having no breaks, 1 break, etc., by 
Bo, B,, Bz; or in general, let By represent the proportion of the chromo- 
somes showing (within the stretch of m genes) any required number of 
breaks d. 

We must distinguish two cases for the occurrence of the b’s followed 
by a’s: 

(1) If a break b occurs at a sufficient distance from the (latest deter- 
mined) end of the chromosome, it is followed by (k—1) a’s. Thus there 
occur blocks of & joints, in all of which joints the fate is determined by the 
occurrence of b. For example, if k is 4, these blocks are each ba aa. The 
chromosome would show such combinations as: 
baaaaaaaa oraaaabaaaa or aaadabaaa or baaaabaaa, etc. 


(2) But if a break 6 occurs very near the (latest determined) end of the 
chromosome, it cannot be followed by (k—1) a’s, but by some lesser 
number (from 0 to k—2). We shall then (if k=4) have such combinations 
as: 

aaaaaaaab or aaaaaabaa or baaaaaabda, and the like. 

We will distinguish the former case as that in which all the k-blocks 
are complete, while in the second case some of the k-blocks are incomplete. 
These two cases must be dealt with separately. 

(1) All k-blocks complete. If the number of joints is large in proportion 
to k, (the distance over which interference occurs), this first group will 
contain by far the larger proportion of the crossovers; and this is the situa- 
tion that we find in the chromosomes. The proportion of these having 
any given number d of breaks is determined as follows. There will be 
d complete k-blocks, such as baaa. These will occupy of course dk of 
the joints, and there will of course be left 7 —dk joints to be occupied by 
the independently determined a’s, as in the following 13 joints: 

abaaaadabaaaaa 

In such a set of m joints, there are obviously d b’s and (n—dk) inde- 
pendent a’s, so that the frequency of each is a™-6*. But the k- 
blocks can occupy different positions, and the frequency of d complete 
k-blocks will be given by determining the number of diverse positions in 
which d such blocks may occur. 
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We may look upon each k-block as a single unit. Our problem is re- 
duced to general terms by considering these k-blocks as units of one 
kind; the a’s as units of another kind. Since there are d of the k-blocks, 


n—dk are of one kind, d of another kind? The number is of course 
(n—dk+d)! 
(n—dk)! d! 


see text-books of probability; e.g., Carpenter’s ‘““Choice and chance’. 


As each occurs with the frequency a~“6", the total proportion of 


cases with d complete k-sets is 


(n— dk +d)! ap! 


(n —dk) !d! 

(2) Some of the k-blocks not complete (since a b occurs within the distance 
(k—1) of the termination). It is obvious that there will be k—1 possible 
cases in which a b occurs within the distance k —1 of the end; these will be 
followed by a’s in numbers ranging from 0 to k—2. To determine the 
proportions of these in which there are d breaks, we may deal first with the 
set in which the final } is just k—1 joints from the end, so as to be followed 
by (k—2) a’s. Then of the 2 joints, (d—1) k will be taken up by complete 
k-blocks, while k—1 are occupied by the terminal incomplete block, 
making dk—1 that are occupied by the k-blocks. Thus there are left 
(n—dk+1) independently determined a’s. ‘The incomplete k-block is 
fixed at the end of the chromosome, so that we have (d—1) blocks of k 
each, that can be shifted about among the (7—dk+1) a’s. The total 
number of units to be considered is therefore again n —dk+d, and we re- 
quire the number of diverse arrangements that can be made of these if 
n—dk+1 are of one kind; d—1 of another. This (with the total frequency, 
in view of the number of a’s) is: 

(n—dk+d)! a™—®+09! 


(n—dk+1)! (d—1)! 





Similar reasoning applies to the remaining ones of the (k—1) sets in 
which 6 is less than & joints from the end. Ii we work these out, and add 
them to those already obtained, we obtain the complete formula for the 
frequency of any specified number of breaks, B,, in case of interference, 
with the order of determination serial. This formula, fundamental for 


crossing over with interference, is given below: 


n—dk) of the a’s, the total units are »—dk+d. Our problem then is: 
How many different arrangements of »—dk+d things can be made, if 
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Let =the total number of joints dealt with. 
k=the number of joints in one direction over which interference 
extends, including in & the joint at which the break occurs. 
» =the frequency of a break at any one joint, when the conditions for 
interference are not present. 
a =the frequency of no-break at any one joint when the conditions 
for interference are not present (so that a=1—)). 
d=the number of breaks of which the relative frequency is required. 
By=the frequency of just d breaks, in case there is interference ex- 
tending over & joints. 
Then the value of B; is found to be made up of 5b? times the sum of 
k terms, as follows: 


s vl (n—(d—1)k+d—2)! sis (n—(d—1)k+d—3)! 
a= OF : ii Ma +. 
(n—(d—1)k—1)!(d—1)! (n—(d—1)k—2)! (d—1)! 








i (n—(d—1)k+d—4)! 
(n—(d—1)k—3)! (d—1)! 


n—(d—1)k—3) 


a — 





to (k—1) terms, the (k—1)th term being: 


(n—dk+d)! 


dk+1 





(n—dk+1)! (d—1)! 


then (o— ate)! qa” | 
(n—dk)! d! 


It will be observed that, of the terms within the brackets, for the first 
k—1) terms the numerator (factorial) of the’fraction and the first term 
of the denominator (factorial) are decreased, from the first term on, suc- 
cessively by 1, while the exponent of a is likewise diminished throughout 
successively by 1. In case 2 is small in proportion to &, not all the terms 
will be realized; as soon as a term is reached in which the numerator or 
first term in the denominator becomes a minus quantity, that and all 
succeeding terms are to be omitted. It is important to remember, how- 
ever, that when the numerator or a term of the denominator becomes 0!, 
this has the value of 1 (since in general 0! =1). 

The last term of the formula represents the frequency of the complete 
k-blocks; the other terms the frequencies of the incomplete k-blocks at 
the terminus of the chromosome (or of the part of the chromosome 
considered). 

It may be observed that this formula (26) is an entirely general formula 
for the successive terms when in a seriés of things the occurrence of a 
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certain chance result for one unit determines also its results for a number 
of the contiguous succeeding units, the order of determination being 
serial. If we let k=1, so that no other units are determined, we have the 
case of no interference, and formula (26) yields the usual binomial expres- 
sion for the expansion of (a+b)". 

For Bo and B, (the proportion of cases having no break, and 1 
break, respectively), formula (26) simplifies directly into the following 
expressions: 


B,=a"-"} (1+-a+a*............. a*—-)) + (n—k) ab... 2.2... (28) 

Since }=1—a, the first member of this expression for B; is equal to 
a"-(1—a") (see text-books of algebra, e.g., HAwKkes’s Advanced 
(1905), page 22). Substituting this we have for B, the more convenient 
expression: 

Bae (4 — 6) + — BOTW. .....g on nce wk tac we bese ecec ves (29) 


Formulae (26) to (29) give the proportions for the different possible 
numbers of breaks between two genes, in terms of n, k and b. If these 
three values are known, the crossover ratio C can be discovered: by their 
aid, since it will be the sum of all the cases in which there are odd num- 
bers of breaks between the two genes separated by m joints; that is, it 
will be the sum of B,, B;, B;, etc., up to; (plus 1 for any remainder above 
an integral quotient). 

But for the conditions in inheritance we know neither n, k nor b, so that 
it is desirable to obtain formulae for Bo, B,, Bs, etc., and for the cross- 
over ratio C, that shall be independent of the precise values of n, k and }, 
depending merely upon certain general limits to be assigned them, as we 
did for the case of no interference. 

In this case as in that of no interference, we know that the frequency 
of a break at a single joint is less than .005. 

With regard to k, we know, not the actual number of joints over which 
interference extends, but the extent of the chromosome, measured in 
percentage of crossing over, over which it extends. According to BRIDGES 
(1921), in chromosome I of Drosophila, interference is practically complete 
over a distance which itself yields about 15 percent of crossing over; in 
chromosome II near the ends this distance gives about 20 percent of 
crossing over, while in the middle region it is about 10 percent. 

For our purposes, consider the case that two genes are separated by 
just k joints. In this case there can be but 1 break between them. The 
frequency of no break whatever between them will of course be a* (since 
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n=k), so that the frequency of a single break will be 1—a* (since all 
cases that do not show no-break must show a single break). 

Now, suppose that we know the percentage of crossing over yielded by 
the distance over which interference occurs (as we approximately do in 
Drosophila). This percentage of crossing over is due to a single break 
(since only 1 break can occur in that region of interference). It is therefore 
equal to 1—a*. Let this percentage be called P, and for convenience let 
1—P be called A. Since P=1—a‘, A or 1—P=a". Thus, if P be .20 
(as in the second chromosome of Drosophila), then A =.80, and this we 
know to be equal to a*. 

Now, if A=a", of course a=~ A, and since b=1—a, b=1—V+/A. 
We now have the values of a and } transmuted into terms, one of which 
at least (A) is approximately known, experimentally. If now we can in 
some way replace k and m by known quantities, we shall be on the way 
to a solution of our problem. 

Consider the case that 1, although unknown, is some definite number 
of times k. Thus, if k be 20 units, each of which gives 1 percent crossing 
over, we may readily deal with a distance of 25 or 30 or 40 or 100 of such 
units; that is, distances in which 2=1.25 k, 1.5k,2kor5k. This is of 
course done in practice in Drosophila. In general, let 2 =mk. 

Now substitute these values in formula (26). That is, for a substitute 
‘ A, or A™*: for b, substitute 1—4/A; for n, substitute mk. 

We may first undertake this for the values of Bo and B,, in formulae 
(27) and (29). 

From formula (27), we obtain 

B, =(+/A)™, which of course yields directly 

he OIE iain kan ca bcd ceyns an ss ss 2 sab Rage eee eee (30) 

Making the proposed substitutions in formula (29), for Bi, we obtain 
directly (with a slight change in the arrangement of the factors of the 
second member) 

B,=A™-(1—A)+(m—1)A™ . k(1—4/A) 

In the first member of this expression, 1—A is of course P, the extent 
of interference. In the second member we find the expression & (1 
—4+/A), and this we recognize again (as on page 415) to approach, as k 
becomes large, the limit given by the minus natural logarithm of A. 
Substituting these, we obtain the following formula (31) for B,, depending 
entirely on the observationally knowable quantities A, P and m: 

B,=A®-” . P+(m—1)A?-”. (—nat log A) .............655- (31) 

For example, suppose that interference, extends over a distance that 

yields 25 percent crossing over, what is the crossover ratio between two 
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genes lying twice this distance apart? Here the total crossover ratio will 
be due to the value of B;. We have P=.25, A=.75, m=2. Hence: 
C=B,=.75 X.25+.75 X(—nat log .75) = 40.33 percent. 

In the case of the general formula (26), for the value of B, (that is, for 
the proportion giving any designated number of breaks), the conditions 
are of course more complex. It will be observed that the expression within 
the parenthesis in formula (26) consists of k terms, each containing a 
coefficient composed of factorials, and a certain power of a, so that the 


form of each term is 


p! 
———— a’. Now, such a factorial expression is a definite function of 
(p—q)!q! 
p and g, which can be written in the simpler form, more convenient for our 
purposes: 
p' 
A Tr 


a a’, in which /p“ signifies g factors, beginning with p, and descending 
by successive differences of 1. 

We may then write the terms of formula (26) in this manner, at the same 
time making the substitutions proposed. For reasons that will appear 
later, we also multiply the numerator and denominator by k*; this of 
course does not alter its value. We thus obtain: 


nye RORYAY [mm thE pease 

' k@ (d—1)! 

((m—d+1)k+d—3)"@-» | 
4 = = A(m—dt+i—2/k) etc. to R—1 terms; 

(d—1)! 
((m—d) k+d)"* | 
then+ — = +¢) A™ | EE EE TOT td oe Pe eee oo ohn) 
a: 


This formula can be simplified for practical working purposes, and the 
unknown quantity & gotten rid of, without sensibly altering the numerical 
results which it yields, by the following series of considerations, and 
changes based on them: 

(1) The numerator of the first term of the expression for By is the dth 
power of k (1—+/A). In this latter again we find an expression the limit 
of whose value, as & is increased, is the minus natural logarithm of A (see 
page 415). We know that k is a large number; if } is taken as .0025 
(which is beyond doubt an extreme upper limit for it), to yield 20 percent 
interference k would have to be 89+, while if } is less than this, k would 


be still greater. We may therefore replace the numerator of the first 
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member of the expression (32) for B,; by the dth power of the minus 
natural logarithm of A. 

The quantities within the large brackets that enclose the second 
member of the expression for By (formula (32)) are a series of k terms, of 
powers of A,—the coefficients and exponents of A changing according to 
definite laws. The exponents and coefficients must be dealt with sepa- 
rately 

(2) For the first k—1 terms, the exponents of A form a series decreasing 
successively by a difference of the very small quantity }, and the total 
difference between the first and last term of these k—1 termsis 1. (The 


> 


1 e iit 
first exponent ism—d-+1 — , ithe last is m—d+1— Fi ). As the difference 


between the extremes of this long series of slightly differing powers of A 
is so slight. it will make little difference in the total value of the expression 
- ; ‘ 7 ., . 2m—2d+1., 
if we substitute for each exponent the mean of all, which is — ors, 


(Whether this substitution does make an appreciable difference will of 





course be tested later.) 

(3) We have thus brought all the terms within the ‘brackets except 
the last one to a common power of A. If now we can add together their 
coefficients, we shall obtain a single expression for the total value of these 
k—1 terms. The coefficients are, for successive decreasing numbers, the 


functions represented by the form —. The first coefficient of formula 
q! 


(m—d+1)k+d—2)"- 
(d—1)! 


(32) is 








while for the (k—1)th term it is 


((m—d)k+d)'e-) _ F , ‘ ‘ P ; 
ain Now, the sum of a series of successive terms of the form 
d—1)! 
y'9 : 
,, beginning with a value of 1 (in which case p and qg are equal), so that 
q: 





e- 


the series runs ¢,9+0" eto" up to a4 is ‘tl 
! q! ! q! (q+1)! 

placing p and gq by their values in the coefficients of formula (32), we 

find that the total sum of all the coefficients, from the value 1, up’ to the 

((m—d+1)k+d—1)" 

aE Fe 

But the lowest of these (k—1) terms does not have the value 1, but 

((m—d)k+d)'@-" 

ae 


highest value given in the first term, is 
instead the value Hence, from the total sum, just 
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given, must be subtracted that part of the sum given by the series from 
((m—d)k+d—1)'@-» 
(d—1)! 


(m—d)k+d)"4 
((m )k+ ) . The actual sum of the (k—1) co- 





the value 1 up to This part will, according to the 





rule just given, be 
“ 2m —2d+1 
efficients of A” 2 __ is therefore: 
((m—d+1)k+d—1)"* ((m—d)k+d)" : 
ase 7 = an we . +4) ; so that the total value of the first 
(k—1) terms within the large brackets is 
So mde o"" 4- 
d! d! 
There remains the kth or last term of the expression in the brackets of 
(m—d)k+d)" : 
(32): this is ((m ; 2 A™-®, We add this, and at the same time 
d! 
we divide each term within the large brackets by k*, thus removing k* 
from the quantity outside the brackets. This gives us as the present 
condition of the formula for B, the following: 


—d+1)k+d—1)" ((m—d)k+d)" aa Ba 
B= (-natlog A)* ( = Yee )a 2 
d! k@ d! k@ 
((m—d)k+d)" a6] 
d! k# 

From this we can obtain an approximate formula that shall not include 
the unknown quantity k, and that will give results differing from the com- 
plete formula by only a negligible amount (as will be shown). This 
results from the following considerations: 


((m—d+1)k+d—1)" 
q! as k becomes larger approaches 














The first coefficient 





(m—d+1)? ab 
Tan as a limit. Thus, for the values m=4 and d=2 the value of 
(m—d+1)* , 
d! 
((m—d+1)k+d—1)" 
d! ké 
Similarly the second and third coefficients within the parenthesis each 
(m—d)* 


this limiting expression s 4.5. If now we make k=40, the 





value of is 4.5375; if k is 100, its value is 4.515. 





as k becomes large. We may therefore substitute 


approaches 
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these values for the three coefficients. We then obtain as our final ap- 
proximation formula for B, the following which is to be employed in 
computing the crossover ratio, C, in case there is interference: 


—_ \d _ (y, — d)4 2m—2d+1 
Ba=(—nat log 4)é[ *— FEU m—d) , 


(m—d)? oa] 

bee A 

In formula (33): 
d=the number of breaks between the two genes in question. 

B,=the proportion of cases having any number d of breaks. 

A=1—P, where P is the average number of standard units of length 
(each unit yielding 1 percent of crossing over), over which inter- 
ference extends. 

m =the distance between the two genes for which the crossover ratio 
is sought, in units, each of which is P. Thus, if the distance be- 
tween the two genes is 1.25 times the mean distance over which 
interference extends, m=1.25. 

C, the crossover ratio, is obtained from this formula (33) by taking 
the sum of the values of B; for odd values of d_ As it turns out, 
usually, the only odd values of d that have an appreciable value 
are 1 and 3, so that as a rule C=B,+B;. The value of B, is most 
readily obtained from formula (31); that of B; from (33). 

It is to be noted that there may be d breaks and so a real value for Bg, 
only if m is greater than d—1. Thus if the chromosome has length just 
twice that over which interference extends, it is not possible that there 
should be three breaks within that distance. But if it extends beyond 
twice the interference distance, then there can be three breaks. If now m 








is greater than d—1, but less than d, a number of terms in the above 

expression (33) disappear, and the expression for Bz; becomes simplified. 

It is perhaps unnecessary to give in detail the steps of that simplification; 

it will be found that in this case the formula takes the form given in the 

following (34); that is: 

When the value of m lies between d and d—1, 

(m—d+1)* s-d+1 

oe ) 

The justification for making the above approximations of course re- 

quires test. Such test was fully supplied in the course of this investiga- 

tion, through the fact that, as may be supposed, these changes, 





Ba=(—nat log A)? ( 
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approximations and simplifications, did not suggest themselves at first, 
but came a*ter much labor and many divagations. At first it appeared 
necessary to discover the effects of interference by employing the funda- 
mental formula for B;, in terms of nm, k and 6, given in formula (26). 
A maximum value was assigned to b, using for this .005, since as before 
remarked, b is certainly less than this. Then if we assume a certain ex- 
tent of interference P, we can determine the corresponding value of k 
through the relation P =1—a*, set forth on page 433; from this it follows 
that 
log (1—P) log A 
~ log (1—3)’ wi ~ log a 

When 0 is put at .005, so that a=.995, we find that for interference 
extending over a distance yielding 20 percent crossing over, k=45; for 
25 percent, k =58; for 30 percent,k=72,andso on. Taking now values of 
n that are a certain number of times k, we work out the crossover ratios 
by adding the values of the odd numbers of breaks, B;+B;+B;, etc. The 
value of B; is obtained from formula (29); those for B;, B;, etc., are 
gotten with great labor from formula (26). It is found that the results 
do not differ sensibly from those given for the same extent of interference 
and same proportion of to k by our simple general approximation 
formula (33). The following are typical comparative results of the two 
methods, for certain values of m, when the interference extends over a 
distance yielding a crossover ratio of 25 percent. 





TABLE 7 


Crossover ratio, percent. 








COMPLETE FORMULA (26) ASSUMING . 
m b= .005 APPROXIMATION FORMULA (33) 
1 25.00 25.00 
L5 34.08 34.11 
2 40.27 40.32 
2.5 44.22 44.26 
3 46.69 46.77 
4 49.02 49.06 
5 49.49 49.78 











As table 7 shows, the approximation formula (33) is quite accurate 
enough for all purposes, even for so high a value of } as .005; for the 
actually smaller values it is of course still more accurate. It even gives 
results accurate to the third decimal place for values of Bz when there is 
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no interference. In this case the value of k is 1, and m=n, while P=b 
and A=a. Letting b=.005, we find by (33) that B;=.012007, while by 
the simple binomial formula it is .012430. 

Throughout the remainder of this paper, therefore we employ the 
results yielded by formula (33) and its modifications. Thus, the crossover 
ratio C in case of interference is: 

eg a ee ee ee ee ee (35) 

The values of B,, B;, B;, etc. are of course found by (31), (33) and (34). 


The general relations in crossing over with interference 


In forrnula (33) with its modifications, formulae (30), (31) and (34), we 
have an instrument by which we can determine the crossover ratios for 
any degree of interference, with the rules and general relations of crossing 
over when there is interference, discovering whether interference alters 
any of the general relations found to hold for linear crossing over without 
interference; how far there is agreement with our 17 basic propositions; 
and how far the conditions resulting agree with those found in nature. 

1. We may first examine a table of crossover ratios between genes ten 
units apart, with a degree of interference’ comparable to that held to be 
found in Drosophila. Selecting for this purpose interference extending 
over a distance that itself yields 20 percent crossing over, we obtain table 
8. 


TABLE 8 


Crossover ratios for genes A to L, if they are separated by successive stretches each 10 units in length, 
and there is interference extending for 20 units (worked out from formula (33)). 
For comparison with the results when there is no inter - 
ference (table 5), and with the conditions in 
nature (table 2). 









































Aietl-« | Pizgivete,#i«aeis tere 
Bat 

A |.... | 10.0 | 20.0 | 27.9 | 33.9 | 38.3 | 41.5 | 43.8 | 45.6 | 46.8 | 47.7 | 48.4 
B | 10.0] .... | 10.0 | 20.0 | 27.9 | 33.9 | 38.3 | 41.5 | 43.8 | 45.6 | 46.8 | 47.7 
c | 20.0} 10.0] .... | 10.0 | 26.0 | 27.9 | 33.9 | 38.3 | 41.5 | 43.8 | 45.6 | 46.8 
D | 27.9| 20.0| 10.01 .... | 10.0 | 20.0 | 27.9 | 33.9 | 38.3 | 41.5 | 43.8 | 45.6 
E | 33.9| 27.9 | 20.0| 10.0| .... | 10.0 | 20.0 | 27.9 | 33.9 | 38.3 | 41.5 | 43.8 
F | 38.3 | 33.9 | 27.9 | 20.0] 10.0| .... | 10.0 | 20.0 | 27.9 | 33.9 | 38.3 | 41.5 
G | 41.5 | 38.3 | 33.9 | 27.9 | 20.0 |i10.0 10.0 | 20.0 | 27.9 | 33.9 | 38.3 
H | 43.8 | 41.5 | 38.3 | 33.9 | 27.9 |120.0 | 10.0| .... | 10.0 | 20.0 | 27.9 | 33.9 
I | 45.6 | 43.8 | 41.5 | 38.3 | 33.9 |$27.9 | 20.0 | 10.0] .... | 10.0 | 20.0 | 27.9 

46.8 | 45.6 | 43.8 | 41.5 | 38.3 [$33.9 | 27.9 | 20.0 | 10.0| .... | 10.0 | 20.0 
K | 47.7 | 46.8 | 45.6 | 43.8 | 41.5 [$38.3 | 33.9 | 27.9 | 20.0 | 10.0 10.0 
L_ | 48.4 | 47.7 | 46.8 | 45.6 | 43.8 | 41.5 | 38.3 | 33.9 | 27.9 | 20.0 | 10.0 
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A study of table 8 in connection with the fundamental assumptions as 
to the serial order of the genes, shows that interference extending for 20 
units does not alter any of the general relations already found to hold for 
crossing over without interference, except of course our basic proposition 
17, which it satisfies. That is, linear crossing over with interference of 
about 20 units requires and accounts for all of the general relations set 
forth in our 17 basic propositions of pages 397-399. To this very remark- 
able fact we return later. 

2. For full understanding of the relations just mentioned and of those 
to follow, it is necessary to note the effect of interference on the actual 
length of the standard “units” in which the chromosome is measured, 
and on certain other quantities. A unit is the length (or number of joints) 
necessary to give one percent of crossing over. Assuming that before the 
conditions of interference arise, any given joint shows a break on the aver- 
age in a certain proportion 6b of cases, then with the same value of }, 
as interference increases, the length of the unit increases. 

With a given value of }, if & is the number of joints over which inter- 
ference extends, then for genes separated by just & joints the crossover 
ratio C is given by the simple formula C =1—a!' (where a=1—b). (This 
is derived directly from formula (29), replacing » by k, and remembering 
that in this case C=B,.) From this again we can of course obtain the 
value of k for any extent of interference P, by replacing C by P in the 
above formula, and solving for k. This yields: 

log (1—P) 
~ log (1—6) 

The number of joints in a unit is then given by dividing the value of k 
by P (considering the latter as integral percents). An example will make 
this clear. Let the frequency 6 of a break.at any joint be .002, so that 
1—b=.998. It is required to know how many joints there are in k if 
interference extends over a distance that yields 20 percent crossing over. 


log .8 


Here C=P=.20, so that k= = 111.455. 
log .998 


If 111.455 joints are required to give 20 percent crossing over, to give 1 
percent 5.573 are required (since within the stretch k the values are simply 
additive), so that a unit is 5.573 joints. 

Working out the value of a unit for a number of different extents of 
interference by this method, we obtain table 9. 

3. In case there is interference, it is necessary to distinguish carefully 
between the frequency of a break at any joint before the conditions for 
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case of interference would be obtained from the binomial expression 
(a+6)", by determining what part of each of the terms is to be trans- 
ferred to the terms preceding it. What part of those that without inter- 
ference would show two breaks, will now show but one? And so of the 
remaining terms. 


Complete interference 


We shall deal first with the relatively simple possible case that a single 
break in the chromosome prevents the occurrence of another break in this 
entire chromosome. As before, we let 6 represent the frequency of a break 
at any particular joint so long as interference has not come into operation, 
while a represents the frequency of no-break at any particular joint, under 
conditions of no interference. The total number of joints in the chromo- 
some we designate ; interference extends over the entire joints. 

When there is no interference, as we have seen, the relative numbers of 
chromosomes showing the different possible numbers of breaks, from 0 
up, are given by the successive terms of the binomial (a+6)". If, when 
there has occurred a single break, no other occurs, it is obvious that all 
the chromosomes will now show a single break, except those which, in 
case there were no interference, would show no break. The proportion 
of these latter is given by the first term of the binomial (@+5)"; it is of 
course a". The proportion of the chromosomes showing a single break is 
therefore in the case of complete interference, 1—a"; and this of course 
will be the crossover ratio, as well as the value of B, (the proportion of 
cases having a single break). We therefore have for the case of complete 
interference for the entire length of the chromosome the formula: 


From this formula it is clear that, if interference is complete throughout 
the length of the chromosome, C may have any value between 0 and 1. 
Crossover ratios above 50 percent may occur as readily as ratios below 
50 percent. For whatever value be given to a (the proportion of cases in 
which a given joint is not broken, so long as interference has not come into 
action), if m be taken sufficiently great a" will become less than 3, so 
that C will be greater than} or 50 percent. Thus, if for b we take the 
super-maximal value of .005, so that a=.995, then for n=138.28, C=} 
or 50 percent; for »=277, C=75 percent; for n=455, C=90 percent. In 
general the number of joints m, between the two extreme genes, necessary 
to give for them any particular crossover ratio C is given by transforming 
formula (21) as follows: 

1—C =a" 
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Whence 
ue thee cones . (22) 
log a 

In formula (22) of course m is the total number of joints in the chromo- 
some, and C the crossover ratio for the two end genes. 

In the case of complete interference, if any given joint shows a break 
as frequently as any other, the actual mean frequency of a break (which 
we may call £, to distinguish it from b, the frequency when there is no 
interference) is, of course: 


OT el | (23) 


And for genes separated by any number, m, of joints the crossover ratio 
is the frequency of a single break in the total m joints, so that: 

. m(1—a") . 
C= B, SS ee eB oi ee . 66d Bee Se wh ees . . (24) 
n 

(Where x is the total number of joints in the chromosome.) 

In this case, for comparing the crossover ratios of two stretches of the 
chromosome, it is evident from (24) that their ratios are directly propor- 
tional to the number of joints they contain. That is, if C; is the ratio 
for my, joints, then for Cs, the ratio for mz joints, we have 


—=— whence Co=——.........-.eee eee: hee) 
Ci my, 


Co me mz Cy 





And of course the crossover ratio for the sum of two or more contiguous 
stretches of the chromosome is the sum of the ratios for the several 
stretches taken separately. The value of the crossover ratio between any 
two genes would be the same as the number of units (each yielding 1 per- 
cent of crossing over.) 

No cases are thus far known of complete interference; the observations 
show that, interpreted on the linear theory, some of the chromosomes 
show more than a single break. We proceed therefore to examine: 


Partial interference 


In partial interference, the occurrence of a break at a given joint inter- 
feres with another break within a certain distance from that joint, but 
this distance is not so great as to include the entire length of the chromo- 


some. Hence, two or more breaks may occur in the chromosome, depend- 
ing on its length. 
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The generalized problem of partial interference is complicated by the 
fact that the results differ slightiy, depending on the order in which the 
fate of the different joints of the chromosome is determined. This may be 
illustrated by a simple example. Suppose that when a break occurs, this 
prevents the occurrence of another break for three joints in each direction. 
Then if in the 6 joints shown below, there would without interference 
occur three breaks (b), at joints 1, 3 and 5, 

2S 245.6 

b b 2b 
as shown, the number of breaks remaining in case there is interference 
will depend upon which of the three breaks is produced first. If number 1 
is produced first, this will interfere with number 3, but not with number 5, 
so that there will remain two breaks. But if number 3 is produced first, 
this will interfere with both the other two breaks (since these are both 
within 3 joints of it), so that there would now remain but a single break. 
The order of determination affects in other ways also the proportions 
showing particular numbers of breaks. 

If, however, the linear theory holds for the chromosomes, it is extremely 
probable that the determination of the fate of the joints occurs in serial 
order, from one end of the chromosome to the other. For the cytological 
evidence indicates that when the conjugation of the chromosomes occurs, 
in the very long thread-like stage, the union begins at-one end and pro- 
gresses toward the other;and the separation of the two united chromosomes 
appears to take place in a similar manner. The determination of the 
fate of the joints must apparently occur either when the two chromosomes 
are uniting, or when they are separating. 

It appears therefore that a solution of the problem for determination 
of the fate of the joints successively in linear order, is what we require. 
In any case, however, it appears that when the frequency of breaks is 
small, and the number of joints is large, as we know to be the case in the 
chromosome (if the linear theory holds), the difference in the results due 
to different orders of determination would be so minute as to be entirely 
negligible. A number of typical particular diverse orders worked out 
gave results differing only in the fourth or fifth decimal place or beyond.* 


3 Tf it be held that the first break occurs as frequently at any one joint as at any other, and 
that any given order of determining the fate of the entire joints occurs as frequently as any 
other; then there would be 2! equally frequent orders of determination. By transforming, in 
accordance with these principles, the binomial (¢+-b)" into an expression giving the mean result 
for all these 2! orders of determination, in case of interference extending for any given number of 
joints, a solution under these conditions would be obtained. Apparently such a solution would 
not differ in numerical results appreciably from that given in the text. 
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We assume then that the determination of whether each joint is to 
show a break or no break occurs in serial order, from one extremity of the 
chromosome to the other. Thus in the series 

2.3: 42.53 BF S$ 


the fate of number 1 is first determined, then that of number 2, and so on. 
(Determination proceeding serially from the middle in both directions, as 
might occur in V shaped chromosomes, would give essentially identical 
results.) 

We shall, as in the case of no interference, first develop formulae in 
which b, the frequency of a break at any given joint, and m, the number 
of joints, are assumed to be known; from these we shall then obtain general 
formulae showing the relations even when 0 and ” are not known. 

It is convenient to designate as k the number of joints whose fate is 
determined (in one direction, of course, if the order of determination is 
serial), when a break occurs at a given joint, including in & the joint at 
which a break occurs. Thus, if, when a break occurs, this prevents a 
break in the four succeeding joints, the value of & is 5S. 

The problem of the results of interference ina series of joints, in case 
is greater than the distance of interference, may be attacked directly, in 
the following manner. 

The chromosome has joints between its +1 genes. At each joint, so 
long as interference does not come into operation, a break occurs in the 
fraction 6 of all cases; no-break in the fraction a (=1—6). Whenever; the 
fate of a certain joint is determined as no-break, this has no effect on the 
fate of succeeding joints. But when a certain joint receives a break, then 
a certain number (k—1) of the succeeding joints are determined as yield- 
ing no break, so that at any break a total of k joints is determined. 

For convenience of illustration, we will designate a break by the letter 
b; no-break (when independently determined) by the letter a; these will 
also represent the proportions of cases in which these conditions occur. 
We may further designate a “no-break” which is determined by a fore- 
going break, by the Greek letter a. If then, for example, n=8, k=4, 
and the first break occurs at the third joint, while no other occurs, we 
represent the condition as follows 


@GQabaaaaa 


And the proportion of cases in which such a result will occur is given by 
the product of the a’s and the 6’s; so that in this case the proportion is 
a‘b (that is, in .06561 of all cases, if a=.9, b=.1). The a’s do not have 
to be included in the product, since they occur in all cases after a b has 
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occurred at the given point; the value of each a as a factor would be 1, 
thus not altering the result. 

Now, the problem which we have to solve is this: in what proportion of 
the cases would no J, 1b, 26, etc., occur in m joints, if a b, on its occurrence, 
is inevitably followed by (k—1) a’s (no-break)? For convenience, let us 
represent, as before, the proportions having no breaks, 1 break, etc., by 
Bo, B,, Bz; or in general, let By represent the proportion of the chromo- 
somes showing (within the stretch of m genes) any required number of 
breaks d. 

We must distinguish two cases for the occurrence of the b’s followed 
by a’s: 

(1) If a break 6 occurs at a sufficient distance from the (latest deter- 
mined) end of the chromosome, it is followed by (k—1) a’s. Thus there 
occur blocks of & joints, in all of which joints the fate is determined by the 
occurrence of b. For example, if & is 4, these blocks are each ba aa. The 
chromosome would show such combinations as: 


baaaadaaaa oraadaabaaaa or aaaaabaaa or baaacbaaa, etc. 


(2) But if a break b occurs very near the (latest determined) end of the 
chromosome, it cannot be followed by (k—1) a’s, but by some lesser 
number (from 0 to k—2). We shall then (if k=4) have such combinations 
as: 

aaaaaaaab or aaaaaabaa or baaaaaabda, and the like. 


We will distinguish the former case as that in which all the k-blocks 
are complete, while in the second case some of the k-blocks are incomplete. 
These two cases must be dealt with separately. 

(1) All k-blocks complete. If the number of joints » is large in proportion 
to k, (the distance over which interference occurs), this first group will 
contain by far the larger proportion of the crossovers; and this is the situa- 
tion that we find in the chromosomes. The proportion of these having 
any given number d of breaks is determined as follows. There will be 
d complete k-blocks, such as baaa. These will occupy of course dk of 
the n joints, and there will of course be left » —dk joints to be occupied by 
the independently determined a’s, as in the following 13 joints: 

abaaaadabaaaaa 


In such a set of m joints, there are obviously d b’s and (m—dk) inde- 
pendent a’s, so that the frequency of each is a®-}*, But the k- 
blocks can occupy different positions, and the frequency of d complete 
k-blocks will be given by determining the number of diverse positions in 
which d such blocks may occur. 
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We may look upon each k-block as a single unit. Our problem is re- 
duced to general terms by considering these k-blocks as units of one 
kind; the a’s as units of another kind. Since there are d of the k-blocks, 
(n—dk) of the a’s, the total units are »—dk+d. Our problem then is: 
How many different arrangements of »—dk-+d things can be made, if 
n—dk are of one kind, d of another kind? The number is of course 


(n—dk+d)! 
(n—dk)! d! 


(see text-books of probability; e.g., Carpenter’s “Choice and chance”’.) 

As each occurs with the frequency a”~*)5*, the total proportion of 
cases with d complete k-sets is 

(n—dk+d)! a*-54 
(n—dk) \d! 

(2) Some of the k-blocks not complete (since a 6 occurs within the distance 
(k—1) of the termination). It is obvious that there will be k—1 possible 
cases in which a 6 occurs within the distance k—1 of the end; these will be 
followed by a’s in numbers ranging from 0 to k—2. To determine the 
proportions of these in which there are d breaks, we may deal first with the 
set in which the final 5 is just k—1 joints from the end, so as to be followed 
by (k—2) a’s. Then of the » joints, (d—1) k will be taken up by complete 
k-blocks, while k—1 are occupied by the terminal incomplete block, 
making dk—1 that are occupied by the k-blocks. Thus there are left 
(n—dk+1) independently determined a’s. The incomplete k-block is 
fixed at the end of the chromosome, so that we have (d-—1) blocks of k 
each, that can be shifted about among the (n—dk+1) a’s. The total 
number of units to be considered is therefore again n —dk+d, and we re- 
quire the number of diverse arrangements that can be made of these if 
n—dk-+-1 are of one kind; d—1 of another. This (with the total frequency, 
in view of the number of a’s) is: 


(n—dk+d)! a®—#+p¢ 








(n—dk+1)! (d—1)! 

Similar reasoning applies to the remaining ones of the (k—1) sets in 
which 6 is less than & joints from the end. If we work these out, and add 
them to those already obtained, we obtain the complete formula for the 
frequency of any specified number of breaks, B,, in case of interference, 
with the order of determination serial. This formula, fundamental for 
crossing over with interference, is given below: 
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Let »=the total number of joints dealt with. 
¢-=the number of joints in one direction over which interference 
extends, including in & the joint at which the break occurs. 
b =the frequency of a break at any one joint, when the conditions for 
interference are not present. 
a=the frequency of no-break at any one joint when the conditions 
for interference are not present (so that a=1—6). 
d=the number of breaks of which the relative frequency is required. 
B,=the frequency of just d breaks, in case there is interference ex- 
tending over & joints. 
Then the value of B; is found to be made up of 6¢ times the sum of 
k terms, as follows: 


B -v[ (n—(d—1)k+d—2)! pudinaiiccc (n—(d—1)k+d—3)! ‘ 
o" L(n=(d—-1)k-1)!(d-1)! (n—(d—1)k—2)! (d—1)! 
(n—(d—1)k+d—4)! 

(n—(d—1)k—3)! (d—1)! 

ka ..to (k—1) terms, the (k—1)th term being: 


(n—dk+d)! 
(n—dk+1)! (d—1)! 


(n—dk+d)! (n—dk) 
(n—dk)!d! ° 


It will be observed that, of the terms within the brackets, for the first 
(k-1) terms the numerator (factorial) of the fraction and the first term 
of the denominator (factorial) are decreased, from the first term on, suc- 
cessively by 1, while the exponent of a is likewise diminished throughout 
successively by 1. In case m is small in proportion to , not all the terms 
will be realized; as soon as a term is reached in which the numerator or 
first term in the denominator becomes a minus quantity, that and all 
succeeding terms are to be omitted. It is important to remember, how- 
ever, that when the numerator or a term of the denominator becomes 0!, 
this has the value of 1 (since in general 0!=1). 

The last term of the formula represents the frequency of the complete 
k-blocks; the other terms the frequencies of the incomplete k-blocks at 
the terminus of the chromosome (or of the part of the chromosome 
considered). 

It may be observed that this formula (26) is an entirely general formula 
for the successive terms when in a series of things the occurrence of a 








a" (d—1)k—2) + 





q*— (@—Dk—3) 4 5 ak, A 


(n—dk+1) 





then 
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certain chance result for one unit determines also its results for a number 
of the contiguous succeeding units, the order of determination being 
serial. If we let k=1, so that no other units are determined, we have the 
case of no interference, and formula (26) yields the usual binomial expres- 
sion for the expansion of (a+))". 

For Bo and B, (the proportion of cases having no break, and 1 
break, respectively), formula (26) simplifies directly into the following 
expressions: 

Bo=a" eel abil cat ah obras ide) dk gcse ee SLO na Ricerercean, i When O7 ie oro 8 ae Mae ra, ws aie S08) are (27) 

B,=a"-"} (1+a+a?............. a*-)) +(n—k) a®-"b.... 22... (28) 

Since b=1—a, the first member of this expression for B,; is equal to 
a®-(1—a") (see text-books of algebra, e.g., HAwkers’s Advanced 
(1905), page 22). Substituting this we have for B, the more convenient 
expression : 

pe ated | de ot aie, cr er ca (29) 


Formulae (26) to (29) give the proportions for the different possible 
numbers of breaks between two genes, in terms of m, k and 6. If these 
three values are known, the crossover ratio C can be discovered by their 
aid, since it will be the sum of all the cases in which there are odd num- 
bers of breaks between the two genes separated by joints; that is, it 
will be the sum of B,, Bs, Bs, etc., up to; (plus 1 for any remainder above 
an integral quotient). 

But for the conditions in inheritance we know neither n, k nor b, so that 
it is desirable to obtain formulae for Bo, B:, Bs, etc., and for the cross- 
over ratio C, that shall be independent of the precise values of n, k and b, 
depending merely upon certain general limits to be assigned them, as we 
did for the case of no interference. 

In this case as in that of no interference, we know that the frequency 
of a break at a single joint is less than .005. 

With regard to k, we know, not the actual number of joints over which 
interference extends, but the extent of the chromosome, measured in 
percentage of crossing over, over which it extends. According to BRIDGES 
(1921), in chromosome I of Drosophila, interference is practically complete 
over a distance which itself yields about 15 percent of crossing over; in 
chromosome II near the ends this distance gives about 20 percent of 
crossing over, while in the middle region it is about 10 percent. 

For our purposes, consider the case that two genes are separated by 
just & joints. In this case there can be but 1 break between them. The 
frequency of no break whatever between them will of course be a* (since 
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interference have arisen (this frequency we have called 6), and the actual 
average frequency of a break at any joint, in view of the fact that after 
one break has occurred interference does arise. This actual average 
frequency of a break at any joint, in case of interference, we may designate 
8. With constant b, the value of 6 decreases as interference increases. 
This may be obtained for the distance k, over which interference extends, 
by dividing the value P (=C for k joints) by k. The value of 8 for differ- 
ent extents of interference when )=.002, is given in the third row of 
table 9. 


TABLE. 9 
Number of joints in a single unit giving 1 percent crossing over, for certain differcnt values of inter- 


Serence, in case b, the frequency of a break at any joint before the 
conditions for interference arise, is .002. 





i:xtent of interfer- 














ence, percent....! 0 mo | 2 30 35 40 50 75 90 
ee a | So ee ee ee es ee ie Motes 
Number of joints} | | 

| 5.57] 5.75 | 5.94 | 6.15 | 6.38| 6.94) 9.23 | 12.78 


for 1 unit.. ; 5.04 





| | i 
Average frequency 
of a break at| | 
any joint (=8). 002] 00179| .00174} .00168) .00163 





00157} .00144) .00108) .00078 








4. If interference extends for not more than 30 units, the crossover 
ratio satisfies the first of our basic propositions (page 397),—namely, that 
the ratio rises to about 50 percent, but does not go appreciably above 50 
percent. 

5. But if interference extends for a distance greater than 30 units, 
then as the distance between the genes becomes greater, the ratio 
rises above 50 percent. At interference of 30 units it rises theoretically to 
a maximum of 50.6 percent (at a length of 120 units); for still higher values 
of interference it rises still higher, as shown in table 10. If interference is 
sufficiently extensive it may rise to any value less than 100 percent. 

It is to be kept in mind, of course, that for the different grades of inter- 
ference the units are of diverse lengths (table 9), so that the third column 
of table 10 does not give the relative actual lengths at which the maxi- 
mum ratio is reached. These may be obtained by multiplying the values 
in the third column of table 10 by the corresponding lengths of the 
units in the second row of table 9. If we adopt the lowest of these values 
as a unit, the relative lengths are shown in the fourth column of table 
10. 
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6. When as a result of interference the maximum crossover ratio rises 
above 50 percent, then as the distance between the genes is still farther 
increased, the ratio undergoes oscillatory changes illustrated in the 
curves of figure 1. Beyond the maximum the ratio begins to decrease, 
falling to 50 percent, then below 50 percent. The amount by which it 
thus goes below 50 percent is somewhat less than the amount by which, at 
its highest, it exceeded 50 percent. As the distance between the genes is 


TABLE 10 


Limiting values of the crossover ratio, with different extents of interference (measured in units each of 
which gives 1 percent crossing over); with the number of units distance at 
which the ratio is at the maximum. 

















[APPROXIMATE DISTANCE AT 
EXTENT OF LIMITING CROSS- | ss eee 2 ’ : APPROXIMATE RELATIVE 
WHICH THE MAXIMUM 
INTERFERENCE OVER RATIO IN | c TS - ae ACTUAL LENGTHS AT WHICH 
| VALUE IS ATTAINED, 
UNITS PERCENT | . . MAXIMUM IS ATTAINED 
| IN UNITS 
| | 
0 50 
20 50 cor tet 
25 50 175 1.75 
30 50.6 120 1.24 
35 52.4 95 1.02 
40 54.9 90 1.00 
50 60.6 85 1.03 
75 77.9 90 1.45 
90 | 90.0 90 2.00 








still farther increased it turns again and climbs once more above 50 per- 
cent, but does not again reach the maximum previously attained. It 
then falls again, and continues to oscillate about 50 percent, the divaga- 
tions above and below that value becoming less and less, until it finally 
settles down at approximately 50 percent, the oscillations becoming im- 
perceptible. As the curves of figure 1 show, the dips below 50 percent are, 
in the case of high interference, often less in amount than the later rise 
above 50 percent, so that the oscillations above and below the line of 50 
percent are not symmetrical. 

These oscillations in the value of the crossover ratio are due to the fact 
that odd numbers of breaks give crossing over, while even ones do not. 
At the maximum value of the ratio the prevailing number of breaks is 1; 
this then decreases, while the proportion with 2 breaks increases, thus 
decreasing the ratio; and so on. For example, when the interference is 
75 percent, the maximum value of the ratio (77.9 percent), is likewise 
the percentage with a single break, while now only 3.2 percent have two 
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breaks. At the minimum value (38.8 percent), there are but 29.4 percent 
with 1 break and 9.4 percent with 3 breaks, while 58.8 percent have two 
breaks (the remainder, 2.4 percent having no breaks). 

Figure 1 shows sketches of comparative curves of the crossover ratios 
for different amounts of interference, from 0 to 90 percent. The lengths 
are measured in the standard units of length yielding 1 percent crossing 
over. It is to be remembered of course that these units are of different 
lengths for different degrees of interference, as shown in table 8. 

Examination of the curves of figure 1 shows that for different degrees of 
interference the curves diverge more and more widely until the genes are 
about ninety units apart; here the divergence between the different curves 
isata maximum. Then the curves approach, and for genes distant apart 
about 150 units the crossover ratios are nearly the same (in the neighbor- 
hood of 50 percent) for all degrees of interference. For still greater dis- 
tances the divergences increase and decrease in an oscillating manner. 

7. It is obvious that for high extents of interference, in which the 
crossover ratio goes above 50 percent, the serial order of the genes would 
not be given by the increase of the crossover ratio measured from the 
end gene. Genes farther from the end gene would often have a lower 
ratio than those nearer. Genes near the end gene would give with it a 
low ratio, this gradually increasing with the distance, to above 50 percent, 
then decreasing with greater distance, again increasing, and so on. A 
chromosome map constructed, like those in use in Drosophila, on the 
basis that increased crossover ratios meant greater distances would give 
incorrect and inconsistent results, the genes seeming to be in different 
orders when different basic genes were used from which to measure the 
ratios of the rest. 

Even in such cases, however, a map giving the genes in correct order 
could be obtained by considering successively genes having low crossover 
ratios (say 10 percent or less). The order of 3 genes whose ratios all fall 
within 10 to 20 percent would be first determined, then the order of two 
of these with a fourth that was near; by a concatenation of such successive 
determinations, in the manner now practically employed by the students 
of Drosophila, the correct order would finally be obtained for the entire 
chromosome. 

None of these difficulties or inconsistencies are normally met in the case 
of Drosophila, so far as present knowledge goes; since interference does 
not extend so far as to give crossover ratios above 50 percent. A case 
giving results akin to these was however described briefly by MULLER at 
the meeting of the AMERICAN Society oF ZoOLocistTs in December, 1921. 
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In a particular instance a gene was prese: which interfered with a 
break in the chromosome for a longer distance than usual; for genes more 
than twice that distance apart two breaks were more frequent than 1, so 
that the crossover ratio became less for genes farther apart than for those 
nearer together. 

At the same meeting IBSEN (1922) described in the guinea-pig certain 
genetic results that could be explained by assuming that 75 percent 
crossing over had occurred in parents of both sexes. This possibility had 
not been subjected to a critical test. If substantiated, it would follow that 
in the guinea-pig the relative frequency of crossing over between genes 
would not indicate the serial order of the genes in the chromosome: this 
serial order would be determined only by dealing successively with genes 
having low crossover ratios. 

8. In comparing the computed relations with those found in experi- 
mental breeding, it is to be noted that our curves and computations are 
for certain definite average extents of interference. The general results 
would be similar whether this average represented a single invariable 
distance over which interference extends, so that no second break ever 
occurred within that distance, while beyond that distance there was no 
interference; or whether it is an average of the different distances over 
which interference extends in different cases. In Drosophila it is clear 
from the breeding data that the distance over which interference acts 
is not invariable; at certain distances from a given break another break is 
merely less frequent than would be the case if the given break did not 
exist.. The distance over which interference extends is not given by the 
students of Drosophila as an average, but simply as the distance over 
which interference is practically complete. It appears clear that the aver- 
age distance over which interference extends would be somewhat greater 
than this distance over which interference is practically complete. 

According to BripGEs (1921, p. 127), this distance over which inter- 
ference is practically complete is (from the results of experimental breed- 
ing) in the first or X chromosome of Drosophila about 15 units or 15 
percent; for the second and third chromosomes of Drosophila it is about 
20 percent in the end regions, 10 percent in the middle regions. These 
would correspond to averages somewhat higher, perhaps from 20 to 25 
percent. In none of these cases therefore should there occur crossover 
ratios above 50 percent, and according to BripGEs (1921, p. 128) no ratios 
above 50 percent have yet been met in Drosophila. The serial order of 
the genes would be indicated in all such cases by the increasing crossover 
ratios with the end gene. 
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9. On page 422 it was shown that there are four sericus discrepancies 
between the crossover relations found in nature and those which must 
occur if crossing over occurs as the linear theory holds, but without inter- 
ference. One of these discrepancies consisted of the very facts for explain- 
ing which the hypothesis of interference was introduced. The three others 
were numerical discrepancies, not held in view (nor even known) when the 
hypothesis of interference was proposed. It remains to determine whether 
these discrepancies would be removed if interference occurs. We will deal 
with the three in succession. 

10. The first numerical discrepancy was, that the observed values of 
the ratios are inconsistent with each other; if crossing over occurs without 
interference, the observed ratios for distant genes are regularly greater 
than they should be as compared with those for genes close together. 
That is, if we compute from the shorter stretches of the chromosome the 
necessary crossover ratios for the longer stretches, made up of the shorter 
ones, we find that these ratios computed on the basis of no interference 
are always less than those actually observed (see page 419). 

Examining the results of interference, we find that this does increase 
the crossover ratios for distant genes, as compared with those between 
genes close together, thus causing the required conditions to approach 
those actually found. For example, if we take the crossover ratio between 
genes 10 units apart as that for genes “‘close together,’ and employ this 
as unity, then table 11 shows what number of times this ratio is yielded by 
genes separated by greater distances,—in the case of certain degrees of 
interference. 














TABLE 11 
DISTANCES BETWEEN THE GENES, IN UNITS OF 10 PERCENT 
INTERFERENCE 

10 20 30 60 100 160 

0 1 1.83 2.48 3.84 4.74 5.25 

20 1 2.00 2.79 4.15 4.77 4.97 

25 1 2.00 2.90 4.37 4.91 4.99 

30 1 2.00 3.00 4.60 5.04 5.03 

















It will be observed in this table that for distances up to 100 units 
(where the crossover ratio reaches 47 to 49 percent) the more the inter- 
ference, the greater is the proportion of the longer stretches as compared 
to the shorter ones. At very long distances, however, the two become 
equalized, and finally the proportion is greater for no interference. Most 
of the crossover ratios usually dealt with fall in the intermediate region. 
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To determine how closely the theoretically required ratios approach 
those found in nature, we may compute from the ratios observed for short 
distances those that are required for longer distances, on the basis of dif- 
ferent degrees of interference. For the short distances, we will employ 
the well-established ratios, each based on more than 5000 cases, that are 
set forth on page 419. In table 12 are given for the longer stretches: 
(a) the ratios required if there is no interference; (b) those required by 20 
percent interference; (c) those required by 25 percent interference; (d) 
the observed ratios. 


TABLE 12 


Comparison of the crossover ratios for the long stretches between the genes named, as computed from 
the component shorter stretches given on page 419, for different extents o 
J “A 
interference (a, b and c), with those observed (d). 


Chromosome I 











(a) (b) (c) (d) 
NO | INTERFERENCE | INTERFERENCE OBSERVED 
| INTERFERENCE | 20 PERCENT 25 PERCENT RATIOS 
Yellow-miniature...........| 31.6 | 32.8 33.3 34.3 
White-rudimentary. 38.2 40.0 40.5 42.4 
White-bar. .. 38.7 | 40.9 41.6 43.6 
Vermilion-bar. .... 21.1 23.4 23.9 23.9 
Vermilion-fused. . . : 22.5 | 25.3 26.2 25.8 
Vermilion-forked....... a 20.0 | 23.0 23.5 23.5 








Chromosome II 





| 47.9 49. 








SO: ee cee | 47.4 0 48 .3 
Star-curved... | 43.4 44.1 45.4 45.9 
Star-purple....... 39.4 39.8 40.6 43.7 
Dachs-vestigial. . . 29.3 31.4 32.9 29.6 
Black-morula. . . . 43.8 44.3 44.8 46.6 
Purple-speck...... scale a 38.1 39.4 40.9 45.7 
Chromosome III 
Sepia-rough............... 35.5 39.1 41.2 39.1 
Dichaete-rough...... 32.0 34.3 36.0 36.1 
re 23.7 29.2 30.5 26.2 





Table 12 is made by the aid of extensive and detailed tables computed 
in the course of this investigation, showing the crossover ratios corre- 
sponding to many different numbers of units of length, for nine different 
extents of interference, ranging from 0 to 90 percent. 
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As table 12 shows, in all cases, the observed values for the ratios are 
greater than those permissible in case there were no interference. No 
exact coincidence of the required with the observed values is, of course, to 
be expected, owing to the numerous sources of irregularity and inaccuracy 
in the observed ratios; but the systematic diversity in one uniform direc- 
tion is significant. With 20 percent interference the required ratios 
approach more nearly the observed ones, but sti]l remain below them (in 
all except two of the fifteen cases). With 25 percent interference the re- 
quired values approach still more closely the observed ones; now in nine 
of the fifteen cases the required ratios are either practically identical with 
the observed ones (vermilion-bar, star-curved, dichaete-rough) or are a 
little above them. It is clear that with the addition of interference of 
about 20 or 25 percent, our first numerical discrepancy disappears; the 
observed ratios for genes close together become consistent with those for 
genes far apart. 

11. The second numerical discrepancy between the observed facts 
and the requirements of the linear theory without interference was that 
the proportions of the chromosomes showing the different numbers of 
breaks would not be those actually found. The latter show a marked ex- 
cess in the proportion having but one break, with a corresponding defi- 
ciency of those having two or more breaks (page 421). Interference, we 
have seen (p. 424), tends to produce just this effect; the question here is: 
does it produce this effect to just the extent demanded in order to bring the 
observations and the requirements into unison? 

In table 13 are given the proportions for the different numbers of breaks 
observed in experiments directed upon this point, by MULLER (1916), 
PLouGH (1917), WEINSTEIN (1918) and GowEN (1919), together with the 
proportions required on the linear theory (a) without interference, (b) 
with 20 percent interference, and (c) with 25 percent interference. In all 
cases the observed proportion showing no breaks is considered the fixed 
point, since this proportion is not altered by interference. From this are 
computed the required proportions for 1, 2, 3 or 4 breaks. For no inter- 
ference this computation is made by the formula (20 a), page 421; for the 
different degrees of interference the computation was made by the use of 
formulae 30, 31, 33 and 34, in connection with extensive tables made for 
producing the curves of figure 1. 

Table 13 shows that with no interference the proportion with one 
break is uniformly much too low, while the proportions with 2, 3, 4 breaks 
are much too high, as compared with the observed proportions. As 
interference is made greater the proportion with one break gradually 
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TABLE 13 
Com parison of the proportions showing 0, 1, 2, 3 cr 4 breaks in chromosomes of Drosophila, with th 
proportions required by the linear theory, (a) when there is no interference; (b) when there is 20 
percent interference; (c) when there is 25 percent interference. The data cre all for Drosophila 
melanogaster, except the case last given, in which the data are for Drosophila virilis. 






















































































Number of breaks. 0 | ct 2 tg 4 | AUTHORITY AND NUMBER OF 
| CHROMOSOMES CASES 
Percentages: | | 
Observed | 54.2 | 41.6 4.3 0 O | Muvtter 1916 712 
Required: | 
Interference 0 | 33.1 | 10.2 | 2.1 0.3 T. Yellow to bar 
Interference 20% 40.0} 5.8 0.1 oO | 
Interference 25% 41.6) 4.3 | 0O 0 | 
Observed 32.6 | S1.1| 15.0 | 1.3 0 | Mutter 1916 460 
Required: 
Interference 0 36.6 | 20.5 | 7.7 | 2.2 | II. Star to balloon 
Interference 20% 44.7 | 19.5 |3.0 | 0.1 
Interference 25% 47.2 118.6 | 19 0.02 
Observed 47.3) 22.5) 0.2 | 0 0 | PLoucH 1917 5134 
Required: 
Interference 0 19.9| 2.4 | 0.2 0 | J. Vermilion to 
Interference 20% 22.6 0.05 0 0 forked 
Interference 25% 22.7 0 0 0 
Observed wo.a | 20.5 1.3 0 0 PrtoucH 1917 17,225 
Required: 
Interference 0 21.4 3.0 0.3 0 II. Black to curved 
Interference 20% 24.2 | 0.6 0 0 
Interference 25% 24.8 0 0 0 
Observed 78.1} 21.8} 0.1 0 0 WEINSTEIN 1918 3394 
Required: (data from BrinGEs) 
Interference 0 19.3 24 10.2 0 I. Vermilion to 
Interference 20% 21.7 0.1 0 0 forked 
Interference 25% 21.9 0 0 0 
Observed 47.3 | 45.9 6.6 0.2 0 WEINSTEIN 1918 2572 
Required: 
Interference 0 35.4 | 13.2 3.3 0.6 I. Eosin to cleft 
Interference 20% 42.6) 9.7 0.4 0 
Interference 25% 44.8) 7.9 | 0.1 0 
Observed 54.6 | 38.9} 6.3 | 0.3 | 0.003 Gowen 1919 31,456 
Required: F 
Interference 0 r 33.1} 10.0 | 2.0 | 0.3 III. Sepia to rough 
Interference 20% m6) $6 16.4 0 
Interference 25% 41.3) 4.2 | 0 0 
Observed $1.2 | 43.4 A | 0.3 0 Gowen 1919 2381 
Required: 
Interference 0 31 82.5 2.6 | 0.4 IIJ. Dichaete to 
Interference 20% 41.4] 7 0.2 0 rough 
Interference 25% 43.1] 5 0.01 0 v 
Observed 32.6 | 43.8 | 20.2 | 3.2 | 0.2 WEINSTEIN 4540 
Required: (unpublished) 
Interference 0 mela 12.8 ae I. Drosophila virilis 
Interference 20% .710.5 | 30 10.1 
Interference 25% 47.2 | 18.6 1.9 0 
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increases, while the proportions with more than one correspondingly 
decrease. At 20 or 25 percent interference the proportions for the dif- 
ferent numbers of breaks become extremely close to those observed. 

Thus, if there is interference our second numerical discrepancy, like 
our first one, disappears; and the degree of interference required to make 
them disappear is the same for both. 

12. The third numerical discrepancy between the results of the linear 
theory without interference and the observed relations was that the 
“map distances” determined. for the chromosomes, are much too small 
(see table 6, p. 422). What effect will interference have on these relations? 

These map distances are measured in units, each of which is the length 
required to give 1 percent crossing over. ‘Table 14 shows the map distances 

TABLE 14 
Com parison of the map distances (in units of 1 percent crossing over), given by BRIDGES (1921), with 


those required by the linear theory: (a) without interference; (b) with 20 
percent interference; (c) with 25 percent interference. 














CHROMOSOME I CHROMOSOME II 
DISTANCES FROM GENE WHITE DISTANCES FROM GENE STAR 
(a) (b) (c) (a) (b) (c) 
Inter- | Inter- | Inter- Given by | Inter- | Inter- | Inter- 
GENE Given by | fer- fer- | fer- GENE BRIDGES fer- fer- fer- 
BRIDGES | ence | ence | ence (1921) ence ence ence 
(1921) 0 20 25 0 20 25 
per- | per- per- per- 
cent | cent cent cent 
White 0.0 0.0 0.0} 0.0 {Star 0.0 0.0 0.0} 0.0 
Bifid 5.8 5.5 5.3 5.3 |Streak 15.4 18.2 15.4 | 15.4 
Club 15.2 16.7 | 14.3 | 14.3 |Dachs 29.1 42.5 31.4 | 29.7 
Vermilion a.5 46.6 | 34.0 | 32.0 |Black 46.5 70.2 49.0 | 44.5 
Miniature 34.6 54.0 | 38.8 | 36.0 |Purple 32.7 102.8 69.2 | 60.2 
Sable 41.5 86.0 | 58.8 | 52.3 |Curved 73.5 123.8 83.1 | 62.7 
Rudimentary} 53.0 93.2 | 63.7 | 55.8 |Speck 105.3 167.4 | 101.6 | 89.9 
Bar 55.5 101.8 | 68.8 | 59.8 
































given by Bripces (1921) for the best-known genes of chromosomes I and 
IL of Drosophila melanogaster, in.comparison with those required: (a) if 
there is no interference; (b) if there is 20 percent interference; (c) if there 
is 25 percent interference. These distances are computed from the most 
reliable data as to the crossover ratios for the two end genes. For chromo- 
some I, the gene white is taken as the terminus, since its ratios with the 
others are much better determined than are the ratios with yellow, 
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commonly taken as the terminus of reference. The ratios employed are: 
for I, those given by MorcGAn and BripcEs (1916); for II, those given by 
BripGEs and MorGAn (1919). The numbers of units required for dif- 
ferent ratios, in the cases of 20 and 25 percent interference, were obtained 
in connection with the extensive tables computed for making the curves 
of figure 1. 

As table 14 shows, while the distances required by the linear theory in 
case there were no interference are entirely incompatible with those set 
forth by BripcEs, with 20 to 25 percent interference the required dis- 
tances approach those given by Bripces. In chromosome I the distances 
for 25 percent interference are very close to those of BRIDGES (save in 
the case of the gene sable; the suspicion is raised that the crossover ratio 
of this with white (41.2 percent according to MorGAN and BrIDGEs 1916) 
may have been inaccurately determined.) In chromosome II, the better 
agreement is given, on the whole, by 20 percent interference, although 
here there are considerable irregularities. This agrees with the facts set 
forth by BripGceEs (1921), namely, (a) that in the second chromosome 
different parts show different extents of interference, and (b) except near 
the ends, this chromosome shows less interference than chromosome I. 

In view of the considerable irregularities to be expected from the many 
sources of error in determining the crossover ratios, and from the fact that 
different parts of a chromosome may have different degrees of inter- 
ference, it is clear that the effect of interference is to bring the distances 
required by the linear theory into essential agreement with those set forth 
by the workers on Drosophila. 


CONCLUSION AND DISCUSSION 


Thus, if there is interference averaging about 20 to 25 percent, the 
theory that the genes dre arranged in a linear series and that crossing over 
takes place at conjugation of the chromosomes by breaks at certain points 
in the series, with interchange of sections of the two chromosomes, yields 
and requires the entire system of complex and peculiar relations found in 
nature (set forth in the 17 propositions of pages 397-399) ; and at the same 
time it requires and accounts for the specific numerical values,—the 
relative crossover ratios of different genes, the “map distances,” and the 
relative proportions of chromosomes showing 0, 1, 2, 3 or more breaks 
between given genes. 

No other theory of crossing over that has been proposed makes so much 
as a beginning in accounting for this system of relations. The theory of 
GoLDscHMIDT (1917), as shown in a previous paper (JENNINGS 1918), re- 
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quires a definite system of relations between the crossover ratios, but it is 
not this system; it is a system quite inconsistent with this one. The 
theory set forth by CASTLE (1919 a, b) likewise does not yield this sys- 
tem; it appears now to have been abandoned by its author. The redupli- 
cation theory supported by some English students gives not the slightest 
ground for the existence of any such system of relations. In view of the 
complexity and extraordinary character of the system, and in view of the 
precise numerical proportions in which it is embodied, it is almost im- 
possible to conceive that more than one set of conditions could produce it. 

It is to be noted that the conditions required for producing the ob- 
served system are simply: (1) the arrangement of the genes in linear 
series, with breaking and exchange at the time of pairing of the chromo- 
somes; and (2) interference extending over an average distance of 20 to 25 
units. It is not required that breaks should be as frequent between any 
two genes as between any other two, nor that equal absolute distances 
should in all cases give the same crossover ratio. The possible case 
that this is true was first investigated, as the simplest condition, and this 
yielded the fundamental formula (1). In this formula the units of length 
are the single joints between the genes, and the crossover ratio varies, in 
the way shown by the formula, with the distance apart of the genes in 
these units. But as soon as we transform our formulae so as to express 
the relations in the “standard” units, each of which is the distance neces- 
sary to give one percent of crossing over (formula (6) and those following 
it for no interference; formulae (30), (31), (33) and (34) for interference), 
they no longer require that breaks should be equally frequent at all joints, 
nor that equal absolute distances should give the same ratios. The only 
difference made if breaks are not equally frequent in all parts of the 
chromosome is that the absolute length of the ‘“‘standard units” varies in 
different regions: the formulae based on these units retain their validity, 
and the system of relations (propositions 1 to 17, page 397) remains un- 
changed. BrincEs (1915), STURTEVANT (1917, 1919), PLoucH (1917) and 
others have shown that various conditions, environmental or germinal, 
may alter the frequency of the breaks and exchanges in crossing over. 
The frequency may be reduced in one part of the chromosome, remaining 
unchanged in other parts (STURTEVANT 1919); then obviously the “stand- 
ard units” are no longer of equal absolute length. Gowen (1919) shows 
that the frequency of breaks is very variable, and DETLEFSEN and 
RoBERTS (1921) show that the variations are heritable and that by 
selection the frequency in a given stock can be greatly changed. But all 
this does not affect the validity of our formulae that are based on the 
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standard units; does not affect the general system of relations (proposi- 
tions 1 to 17); these depend only on the serial arrangement of the genes. 
If it could be shown that in a set of genes having (according to the linear 
theory) the order A-B-C, the crossover ratios between A and B and 
between B and C can be changed without correspondingly altering that 
between A and C, then indeed there would be difficulty for the linear 
theory, but no indication of this has been shown. 

DETLEFSEN (1920), in view of the change in the crossover ratios 
through selection, entitles a paper “Is crossing-over a function of dis- 
tance?”’ and concludes that ‘In view of these considerations it would 
perhaps be simpler to conclude that linkage is not a function of distance; 
i.e., crossing over is not necessarily proportional to distance” (p. 670). 
Obviously in view of the considerations above set forth, crossing over does 
not depend on distance alone and is therefore “not necessarily propor- 
tional to distance’; many other things affect it. On this there is no dis- 
pute; ‘‘actual length of the section between loci is only one of the factors 
determining the amount of crossing over between the loci”’ (MorGAN 1922, 
p. 188). The standard unit yielding one percent of crossing over need not 
be the same in different cases: “It has often been pointed out (e.g., 
STURTEVANT 1913, p. 49; MORGAN, STURTEVANT, MULLER and BRIDGES 
1915, pp. 67-68) that 1 percent of crossing over must not be supposed to 
represent the same actual morphological distance in different chromosomes 
or in different regions of the same chromosome” (STURTEVANT 1919, p. 
238). But distance remains nevertheless an important factor, in that, of 
two genes differing in distance, in the same direction, from a given gene A, 
the more distant one will have the higher ratio (provided of course that 
interference does not go beyond 30 units). 

If this relation of the ratios to distance be abandoned, the entire 
complex system of relations observed between the crossover ratios (page 
397) becomes unintelligible and should not exist. 

All the evidence is that the genes are in a linear series; the complex sys- 
tem of extraordinary relations shown by the crossover ratios is not 
explicable on any other ground. If the order of the genes is thus serial, 
this entire system results, whether the frequency of breaks between genes 
is or is not uniform in all parts of the chromosomes. 


SUMMARY 


1. In the only organism adequately studied (Drosophila), the diverse 
ratios of crossing over between different genes show among themselves a 
complex system of remarkable relations. These are illustrated in table 2, 
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page 396, and are formulated in seventeen propositions, pages 397-399. 
The reader should refer to this formulation as an essential part of the 
present summary. 

2. Any set of conditions which are assumed to account for the diverse 
ratios of crossing over must, if valid, yield and account for this system of 
relations, and for the numerical proportions in which it is expressed. 

The “linear theory,’’—the theory that the genes are arranged in series 
in the chromosomes and that crossing over occurs by breaking and 
exchange at certain points in the series, at the time of conjugation,—is 
examined as to whether it fulfils this test. 

3. Crossing over without interference: From the conditions in the 
linear theory, formulae are developed for the crossover ratio, first in case 
the occurrence of a break at one point does not affect the occurrence of 
breaks elsewhere (“no interference”). Formula (1), p. 405, gives the 
value of the ratio in terms of the number of genes or joints and of the 
frequency of a break at any joint; formula (8) the value for any number of 
“standard units” (each of such a length as to yield one percent of crossing 
over); formulae (10) and (11) the ratio produced by adding together 
lengths, for each of which when taken separately the ratio is known. For- 
mulae (16) and (20) give for any value of the crossover ratio the propor- 
tion of cases showing any particular number of breaks, as 0, 1, 2, 3, 4, etc. 

4. By the aid of these formulae, a series of crossover ratios as required 
by the linear theory without interference are computed and their relations 
developed (table 5, p. 418); these are compared with the conditions found 
in nature. These required ratios are found to show the entire system of 
complex relations found in nature and set forth in the 17 propositions of 
pp. 397-399, except the last one (that for which the hypothesis of inter- 
ference has been proposed). But besides this exception, three important 
numerical discrepancies are found to exist between the conditions re- 
quired by the linear theory without interference, and those observed in 
nature. These are: 

(a) The observed numerical values of the ratios could not occur: the 
observed ratios for distant genes are regularly greater than they could be 
on the linear theory without interference. 

(b) The proportions of the chromosomes showing the different numbers 
of breaks (0, 1, 2, 3, etc.) are not what are required; there is a marked 
excess in the proportion showing only one break, and a corresponding 
deficiency in the proportion showing 2, 3 or more breaks. 

(c) The “‘map distances” of the genes,, that have been determined for 
the chromosomes of Drosophila by MorGAn and associates are much less 
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than are required by the linear theory without interference (table 6, 
page 422). 

Thus the linear theory without interference, in spite of the fact that it 
yields all but one of the general relations found in nature, can not account 
for the observed ratios and their precise relations. 

5. The linear theory with interference: To account for the fact set forth 
in proposition 17, page 399, the hypothesis of interference has been 
proposed. This assumes that the occurrence of a break at a certain joint 
in the linear series of genes interferes in some way with the occurrence of 
another break at any joint within a certain distance. 

If valid, the existence of interference must preserve intact the validity 
of propositions 1 to 16, pp. 397-399; and it must not only account for propo- 
sition 17, for which it was devised; but it must also do away with the three 
discrepancies just mentioned, for which it was not devised. To determine 
whether it can meet this severe test, mathematical formulae are developed 
for crossing over with any extent of interference. First are developed 
formulae for the case, possible but apparently not realized, of complete 
interference,—a break at any point in the chromosome preventing any 
other break in that chromosome (formulae (21) to(25)). For interference 
extending over only a part of the chromosome, for any given distance in 
units, the fundamental formula is (33), page 437; important special cases 
are covered in formulae (30), (31) and (34). 

6. By the aid of the formulae for the crossover ratios required by the 
linear theory, with interference, a series of ratios are computed (table 8, 
page 439, and figure 1); and their interrelations are investigated and com- 
pared with those found in nature. The following are found to be the facts: 

(a) Crossing over as required by the linear theory with interference 
extending over a distance of not more than about 30 percent (30 units) 
yields ratios not rising above 50 percent, and the greater the distance 
between two genes, the greater the crossover ratio between them. These 
are the same conditions occurring in crossing over without interference. 

(b) If interference extends for a distance greater than 30 percent, then 
as the distance between the two genes becomes greater, the crossover 
ratio rises above 50 percent. If interference is sufficiently extensive, it 
may rise to any value less than 100 percent (see table 10, page 442). 

As the distance between the genes is still farther increased, the crossove 
ratio begins to decrease, falling again below 50 percent. It thus undergoe 
oscillatory changes, rising and falling successively above and below 50 
percent, but the oscillations becoming less and less marked, until at a 
sufficiently great distance the ratio settles down to about 50 percent 
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These oscillations are illustrated, for different extents of interference, in 
the curves of figure 1, page 443. 

(c) Thus, for interference extending over long distances (above 30 per- 
cent), the serial order of genes would not be given by the increase of the 
crossover ratio measured from the end gene. Genes farther from the 
end gene would often have a lower crossover ratio with it than genes nearer 
toit. Thus, if the magnitude of the ratios were taken as giving the order 
of the genes, they would appear to be in different orders, when the ratios 
were measured from different basic genes. 

(d) But crossing over as required by the linear theory, with interference 
extending on the average over a distance of not more than 30 units 
shows and requires all the general relations set forth in propositions 1 to 17, 
pages 397-399, as characteristic of the relations found in nature. 

(e) Interference extending on the average to a distance equivalent to 
about 20 to 25 percent crossing over requires and accounts for the condi- 
tions giving rise to the three numerical discrepancies (a), (b) and (c) of 
paragraph 4 of this summary. 

7. Thus, in sum, the linear theory of crossing over, with inter- 
ference of 20 to 25 percent, gives the entire system of complex and peculiar 
relations found in nature (set forth in the 17 propositions of pages 397— 
399) and at the same time requires and accounts for the specific numerical 
data,—the relative ratios of different genes, the “map distances,” and 
the relative proportions of chromosomes showing 0, 1, 2, 3 or 4 breaks. 

No other theory that has been suggested yields any such system. In 
view of the complexity and extraordinary character of this system, and 
in view of the precise numerical proportions it involves, it is difficult to 
conceive that any other set of conditions than that set forth in the “linear 
theory” could produce it. 
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INTRODUCTION 


Most horticultural varieties of fruit trees aré propagated by grafting 
buds or scions on seedling stocks. The seedling stocks are used either 
because the variety can not be perpetuated by cuttings or because it is 
more satisfactory or economical to graft than to develop roots from scions 
of the desired variety. This method of propagation results in a tree with 
a uniform top due to asexual methods of reproduction of the scion or bud, 
but with a variable stock, due to sexual methods of reproduction of the 
root system. 

The dual nature of grafted trees may be favorable or unfavorable de- 
pending on the relationship between stock and scion. It is well known that 
certain combinations of stocks and scions are more vigorous than others. 
[he best combinations may also be determined by soil, climatic condi- 
tions, or type of tree growth desired. Apples on pear roots and pear on 
apple roots make short-lived trees, but the pear does very well on quince. 
Plums are worked on peach roots for light warm soils while plum seed- 
lings are used for stocks in colder regions. Certain varieties of prunes 
are not adapted to peach roots but may do well on almond or apricot. 
The use of certain stocks may also be determined by economic factors. 
For instance, Mazzard} stocks for cherries usually give more produc- 
tive and longer-lived trees than Mahaleb stocks, but the latter are largely 
used by nurserymen because they are cheaper and easier to work. The 
problem of stocks has been ably presented by Karns (1916). 


1 Papers from the Biological Laboratory of the MAINE AGRICULTURAL EXPERIMENT STATION, 
No. 154. 
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Most apple trees are grafted or budded on French Crab seedling stocks. 
The French Crab apple seeds are from “‘natural” or seedling trees grown in 
France, the fruit of which is used for making cider. The seedlings for 
stocks are grown either in France or in the United States. Many of the 
eastern nurserymen import the seedlings from France, but in the Middle 
West seedlings are frequently grown from the French Crab seed. Seeds 
from cider-mill refuse in New England have also been used for seedling 
stocks, but the use of cull fruit of horticultural varieties for making cider 
results in much seed which is not from seedling or “natural’’ trees. The 
seedlings of horticultural varieties are considered inferior to “natural” 
seedlings for use as stocks. 

Since seedlings are used for stocks great variability would be expected 
because apple varieties are genetically impure and most apple trees must 
be cross-pollinated to set fruit. In the experimental seedling orchard of 
586 trees at the MAINE AGRICULTURAL EXPERIMENT STATION great 
variability exists not only in size but in morphological characters of 
individual trees, even when the seed came from a single horticultural 
variety. Whether seedlings are obtained from “natural” trees or horti- 
cultural varieties they will be extremely variabie in growth, adaptability, 
hardiness, season of maturity and many other characters. This point has 
long been recognized by horticulturists. 

With the present methods of propagation it is often simply a matter 
of chance whether a nursery tree has a weak or vigorous root system, 
at least in the case of budded trees. In grafted trees where the scion is set 
deep it is claimed that in many cases enough roots come from the scion 
so that the seedling root is of only temporary value. This conclusion is 
open to some argument, however, and is not true for all varieties (see 
SHAw 1919). The effect of variable stocks on tree growth is not definitely 
known, but it is probable that some of the great variability in nursery 
stock is caused by the variable seedling stocks used in grafting. It 
seems surprising that the question of grafting stocks has been so com- 
pletely neglected. However, it has probably received as much attention 
as some other important horticultural problems. For instance the first 
grade of one-year-old nursery stock advertised by a large company is 
quoted at $30.00 per hundred, while the 5th grade is quoted at $6.00 per 
hundred. Although there is presumptive evidence in favor of the best 
grade of trees, yet we have been unable to find any published results of 
experiments to test the growth and ultimate performance of different 
grades of nursery stock of the same age. Nor are the nurserymen them- 
selves agreed on this question. 
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MATERIAL AND METHODS 


In 1911 the horticulturist of the MAINE AGRICULTURAL EXPERIMENT 
STATION outlined an experiment to test the value of different varieties 
of apples on two kinds of stocks. The varieties used were Gravenstein, 
Rolfe, Milding, Wealthy, Stark, McIntosh, Baldwin, Spy, Larue and 
Tolman Sweet. The stocks selected were French Crab and Tolman 
Sweet. The Tolman Sweet was obtained by budding on French Crab and 
setting the whip deep enough to permit it to root. The French Crab root 
was not removed. In view of SHAw’s (1919) work it is improbable that 
the Tolman ever went onto its own roots, as this variety was found to 
strike roots from the scion in very few cases. The different varieties were 
budded on the two classes of stocks in 1912 and 1913. The trees were set 
in the orchard in the spring of 1914. Forty trees of each variety were 
used —twenty on French Crab and twenty on Tolman Sweet roots, and 
were set in rows of 20 trees each. The trees on each kind of stock were 
planted in alternate rows. The problem as originally outlined is of little 
value due to the fact that all of the stocks can probably be properly classed 
as French Crab, although those designated as Tolman Sweet are double 
worked. However, the work is of value from another standpoint,—.e., 
the permanence of early size differences in apple trees. 

The data collected on this work prior to 1914 are of questionable value 
in some cases and are not sufficiently complete for a thorough analysis 
of the problem. No data were taken on these trees in 1914 and 1915 but 
in 1916 the trunk girth was measured and since then the trunk circum- 
ference has been obtained each fall. Although about 400 trees were set 
in the orchard many of them died and were reset or were seriously injured 
so that in 1922 there were left 208 trees for which records are complete 
and which are free from injury. 

An attempt has been made to analyze the data available to determine 
the relation of early growth to growth in later years and to ultimate 
relative size. There are a number of variable factors, e.g., double-worked 
stocks, budding done in different years, and different varieties used, which 
could have been eliminated at the beginning. The effect of these factors, 
however, can be determined. The present analysis may also be of value 
in showing what factors should’ be eliminated in such an experiment, 
and the method of analyzing such data. 


EXPERIMENTAL RESULTS 


There are five known variables which may affect the size of the apple 
trees in this stock-and-scion orchard. These are, (1) variable stocks due 
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to double-working and deep-setting of roots classed as Tolman Sweet; 
(2) variation in size due to unequal growth of different varieties of buds; 
(3) variation due to year budded,—whether in 1912 or 1913; (4) vari- 
ability due to soil heterogeneity; and (5) variability due to inherent 
differences in vigor and adaptability of the French Crab seedling stocks. 
The last variable is undoubtedly applicable to stocks classed as Tolman 
Sweet as well as those classed as French Crab. 

One possible variable, that of bud mutation, was eliminated in the 
beginning because all buds of each variety were selected from a single 
tree in each case. There is little evidence that this factor is of much 
importance even when buds are selected from trees of different degrees 
of productivity (SAx and GOWEN 1923). 


TABLE 1 


The relation (correlation) of stock, scion, and year budded, with tree size, from 1916 to 1922, and 
the correlation between 1916 growth and ultimate size of apple trees from 1917 to 1922 inclusive 
for 208 trees. Also correlation between variely and size in different years and relation 
between 1916 circumference and circumference in later years with and without cor- 
rection for the effect of varietal differences on trunk girth. 





TRUNK CIRCUMFERENCE 





1916 1917 1918 1919 1920 1921 1922 





Stock 11 03 01 .03 .05 01 
Variety 47+ .04|.39+ .04).40+ .04).42+ .04).42+ .04|.39+ .04 
208 trees Year 














budded]— .11 —.12 —.15 — .18 —.17 —.12 
1916 cir. 90+ .01).79+ .02|.74+ .02|].71+ .02].68+ .03) .66+ .05 
86 trees budded in 
1913 Variety 55+ .04).45+ .05|.49+ .05).54+ .05).54+ .05].51+ .05|.56+ .05 
Scion uncorrected|1916 cir. 91+ .01).81+ .03).75+ .03|.70+ .04|.67+ .04|.64+ .04 
Scion corrected /|1916 cir. 92+ .01).81+ .03).74+ .03).69+ .04| .67+ .04|.65+ .04 

















The effect of certain of these variables is shown in table 1. The two 
classes of stocks, French Crab and Tolman Sweet, were correlated with 
trunk circumference in 1916 and subsequent years. The method used 
was developed by PEARSON (1909). It is evident from the small correla- 
tions obtained that there is no significant difference in size of trees on 
French Crab and the so-called Tolman roots. Although the mean size 
of trees on the two classes of stocks was not significantly different, those 
on French Crab roots were perhaps somewhat more variable. The dif- 
ference between the coefficients of variability for the two classes of stocks 
in 1920 was found to be 4.05+1.82. The increased variability of the 
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trees on French Crab roots may be of doubtful significance either statistic- 
ally or biologically. 

The scion or variety may cause some difierence in size of trees because 
some varieties make a more rapid growth than others. The correlation 
ratios for scion with size in different years varies from .47 +.04 in 1916 to 
.39+.04 in 1917 and in 1921. The Stark trees made the greatest growth 
while the Spy trees:were the smallest. 

The trees budded in 1912 were somewhat larger in 1916 and sub- 
sequent years than those budded in 1913. This difference is not large 
as indicated by the correlations of —.11 in 1916 to —.18 in 1919, and the 
year budded can be considered a minor factor in causing differences in 
size of these trees. 

The soil heterogeneity in the nursery could not be calculated due to 
lack of adequate data. In the orchard, however, the soil was found to 
be uniform when measured by the test for soil heterogeneity proposed by 
Harris (1913). The total number of trees in each row was used for the 
combination plots. The value of r was found to be .02 in 1916 and .01 in 
1921. 
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CORRELATION 


YEAR 


FicurE 1.—Showing the relation between 1916 circumference and size in subsequent years. 
Also probable permanence in size from 1922 to 1928. 


There is a consistent difference in tree size in 1916 and subsequent 
years and these differences are relatively permanent as indicated by the 
comparatively high values of the correlations for trunk circumference. 
The correlation for tree size in 1916 with tree size in 1917 is .90+.01 
while 1916 circumference with 1922 circumference gives a value of 
r=:66+.03. The relation between tree size in subsequent years is shown 
also in figure 1. Tree size in 1916 shows a correlation of .90 with 1917 
size and the correlation between 1916 size and subsequent years gradually 
declines to .66 in 1922. In projecting this curve we find that there will 
probably be little decrease in the relation of 1916 size to size in subsequent 
years after 1927 and that the value of r should never be less than .6 if 
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the size relationships in the future may be predicted from past per- 
formance. 

We wish now to determine the effect of the various known factors in 
causing these permanent differences in tree size. It has been found that 
the two classes of stocks, French Crab and Tolman Sweet (so-called), 
have no appreciable effect on differences in trunk circumference in 1916 
and subsequent years. The year budded causes only very small dif- 
ferences of little or no importance as compared with the correlations for 
size from year to year. The soil was found to be homogeneous and can 
have little effect on variability of tree size. Although the different varie- 
ties of scions cause considerable differences in tree size the variability 
within each variety is so great that scion differences have little effect on the 
relation of tree size in 1916 and subsequent years. The effect of varietal 
differences on tree variability can be shown in two ways. It can be done 
by the method of partial correlation, assuming that the correlation ratio 
can be used as a correlation coefficient. The correlation between 1916 cir- 
cumference and 1917 circumference was found to be .88 when scion differ- 
ences are held constant, and for 1916 with 1921 the correlation was found to 
be .61. It is evident that the correlation between size of trees in different 
years is not significantly reduced by eliminating the effect of different varie- 
ties of trees. The effect of varietal differences can also be eliminated in 
another way. The correlation between scion or variety and trunk girth, and 
for the size in 1916 and subsequent years, is shown for 86 trees in table 1. 
These trees were selected for other purposes but are a random sample of 
the whole population. The effect of variety is somewhat greater than 
for the 208 trees, but the size relationships are about the same in 
the different years. The effects of varietal differences were eliminated 
by the use of a correction factor. This factor was obtained by dividing 
the mean size for each variety by the mean size of all 86 trees. In case a 
variety is smaller than the average of the total population the correction 
factor will be plus while if the reverse is the case the correction factor 
will be minus. The trunk circumference of each tree of each variety was 
multiplied by the correction factor for that variety for every year. Thus 
all varieties were placed on the same size basis so far as varietal differences 
are concerned. The trees thus corrected show no correlation between 
size and variety. The correlation of trunk circumference in 1916 with 
subsequent years, using the data corrected for varietal differences, are 
practically the same as the correlations without the corrections for 
varieties of scions (table 1). It is clear that permanent differences in 
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tree size are so large even in a given variety, that varietal variations have 
little or no effect on these permanent differences. 

It is evident that none of the following variables,—the two classes of 
stocks, scion differences, year budded, soil heterogeneity of the orchard, 
or bud mutation, causes any appreciable part of the permanent differences 
in tree size. There remains only one other known factor, the variability 
of the French Crab seedling stocks. There are undoubtedly a number of 
other possible factors which might cause some permanent differences 
in tree growth, but the above factors are the most probable known causes 
of tree variability. Whatever the factors may be which cause permanent 
differences in tree size it is evident that trees on seedling stocks vary 
greatly in growth and vigor even when most of the other known environ- 
mental factors which might cause variability are under control or are of 
minor importance. 


THE RELATION OF TRUNK CIRCUMFERENCE TO PRODUCTIVITY 


In the analysis of the data on the “stock and scion” orchard the trunk 
girth has been considered as a measure of the value of the tree. It is 
generally recognized by horticulturists that tree size is closely associated 
with productivity. It can also be shown that trunk girth is not only a good 
index of productivity but that the permanence of differences in yield are 
also associated with tree size. In our Ben Davis orchard of 881 trees the 
average interannual correlation for yield from 1914 to 1918, inclusive, was 
found to be .57._ The mean correlation for trunk circumference and yield 
was .67. By means of a partial correlation coefficient it can be shown 
that the permanence in differences in yield are largely due to differences 
in tree size as indicated by trunk girth. The average interannual correla- 
tion for yield, when trunk circumference is held constant, was found to be 
only .22 as compared with .57 when trunk circumference is not controlled. 
The percent of variability controlled by a given correlation, 7, is equal to 
1—V1—r. Thus 18 percent of the variability is controlled by a correla- 
tion of .57 while only 2 percent is controlled by a correlation of .22. This 
means that the permanence of tree yields from year to year are almost 
entirely due to permanent differences in trunk circumference. Trunk 
circumference is then a fairly reliable measure of a tree’s productivity 
and also of the permanence of such productivity. Permanent differences 
in tree productivity have been attributed to inherent differences in 
scions as a result of bud mutation, but in our stock and scion orchard the 
buds of each variety were selected from a single tree in each case, so that 
the permanent differences in size, and therefore in productivity, must be 
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attributed to factors other than bud mutation. It is perhaps significant 
that the degree of permanence of size differences in our apple orchard is 
about the same as the degree of permanence in productivity of citrus 
trees reported by SHAMEL and his colleagues (see SAx and GOWEN 1923). 


CONCLUSIONS 


This investigation indicates that trees which are small at an early age 
in the orchard are likely to remain relatively small, and will therefore 
be relatively unproductive, in subsequent years. The permanence of 
differences in tree size are not due to known variable factors other than 
seedling stocks. In our orchard the tree variability cannot be attributed 
to soil differences, to bud mutation, or to known differential treatment. 
Undoubtedly many unknown factors are involved. It seems probable 
that differences in vigor of the stocks are the cause of at least some of 
the permanent differences in size of the young trees in the orchard. 

The results obtained, although no{ as comprehensive as desired, do 
indicate that the larger grades of nursery stock will result in larger and 
more productive trees than the smaller grades of nursery stock of the same 
age. This conclusion is in accord with WEBBER’s (1920) recommendations 
in regard to citrus nursery stock. 

Although tree variability cannot as yet be directly attributed to variable 
stocks there is considerable presumptive evidence that stocks are an 
important factor in causing permanent differences in size and productivity 
of apple trees. Experiments are now in progress at the MAINE AGRICUL- 
TURAL EXPERIMENT STATION to determine the effect of variable stocks, 
eliminating all known variable factors. 
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INTRODUCTION 
The inheritance of pericarp color has been studied by many workers 

with maize. The ordinary red pericarp has been found to be a simple 
dominant to colorless pericarp. EMERSON (1921), in his studies on general 
plant colors of maize, found that the red color was modified to brown by 
one of the recessive plant color factors. The factorial relations are as 
follows: 

P A=red 

P a=brown 

p A =colorless 

p a=colorless 
A series of pericarp types including red, variegated, red-cob white, 
orange and others, behave essentially as a series of allelomorphs. 


1 Paper No. 108, Department of Plant Breeding, CorNELL University, Ithaca, New York. 
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Another type of pericarp color is a red or purple color frequently found 
among the Indian maize of the West and Southwest. This type we have 
called cherry. In inheritance it has been found to be independent of 
red and related types. It occurs only on purple and dilute-purple plants. 
It is also modified to a brownish by the recessive a factor. 

In addition to the above, there are several pericarp colors which are not 
yet analyzed, such as the sun-red color of many varieties, and some of 
the pericarp colors found in South American varieties of maize. 

The present paper deals with data bearing on the factor relations in- 
volved in pericarp-color inheritance. It is to be followed by papers on 
the allelomorphism of the red pericarp series and on the mutability of 
the factor for variegated pericarp. 

NOMENCLATURE 

For the convenience of the reader, the factors and symbols used in 
this paper are given in the following list: 

P p, members of a series of allelomorphic or closely linked factors for 
red, variegated, red-cob white and other pericarp colors and color patterns. 

r, one member of an allelomorphic series of factors affecting aleurone 
and general plant colors. This allelomorph is necessary for the production 
of cherry pericarp color. 

R’, R*, r’, r*, other members of the same series. None of these produce 
cherry pericarp color. 

A a, Bb, P: pi, general plant-color factors, interacting in the following 
manner (EMERSON 1921): 

A B P,, purple 

A B pi, sun-red 

A b P,, dilute purple 

A b pi, dilute sun-red 

a B P,, brown 

a B pi, green 

a b P,, green 

a b pi, green 


- 


PIGMENTS PRESENT IN COLORED PERICARP 


The red pericarp color is produced by a pigment which is almost 
insoluble in water, alcohol, benzene and similar solvents, but which 
dissolves in hot dilute sulfuric acid. Thin sections? of pericarp (15 to 25x) 

2 Microscopic sections have been prepared as outlined in a previous paper (ANDERSON 


1921). 
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are orange-red, or brick-red, in color. The brick-red pigment is character- 
istic also of variegated, orange, and the other pericarp color.types of the 
same series. Brown pericarp has this pigment replaced by a yellowish one. 

Cherry color is produced by a water-soluble purplish pigment, prob- 
ably the same as that found in the sheaths and leaves of purple plants. 
It is probably an extension of the same pigment into the pericarp. Cherry 
is found only on purple or dilute-purple plants. The brownish pericarp 
color appears likewise to correspond to the brown plant color which 
SANDO and BARTLETT (1922) have shown to be due to the flavonol, querce- 
trin, or its glucoside, isoquercitrin. 

















TABLE 1 
Back-crosses of Ap XaP witha p. 
PARENT TYPES RECOMBINATIONS 
PEDIGREE 
NUMBERS TOTAL 
Ap aP AP ap 
£1494 25 35 31 38 129 
E1497 7 22 21 24 84 
E1514 22 30 40 28 120 
E2019 22 31 23 37 113 
E2021 35 28 31 29 123 
E2074 7 10 15 9 41 
E2099 8 7 6 2 23 
E5105 13 15 13 7 48 
E7288 28 32 29 28 117 
E9084 24 26 37 23 120 
E9085 32 38 36 35 141 
Sub-total 233 274 282 260 
Total 507 542 1049 














FACTOR RELATIONS OF PERICARP COLORS 


In the following pages, data are presented bearing on the inheritance of 
red and cherry pericarp color, the interaction of factors for pericarp color, 
the independence of the factors for red and cherry, and the relation of 
these factors to other factors affecting pericarp color. The data presented 
are selected, partly for simplicity, and partly for their value in the study 
of the linkage relations of Mendelian factors of maize. 


Factor relations of red pericarp, P 


Relation to A 


Crosses of green plants having brown pericarp, a P, with dilute-sun-red 
plants having colorless pericarp, A p, have given in F, dilute-sun-red 














~ 


~ 
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plants with red pericarp. The F, has given red, brown, and white (color- 
less) pericarp, red occurring on dilute-sun-red plants only, brown on 
green plants only, and white on both plant-color types. Similar crosses 
in which one parent had brown pericarp and the other had orange, 
variegated red, or white-capped red pericarp, have given similar results. 
The data from back-crosses of these F,’s with the double recessive, a p, 
are given in table 1. 

A single F, of A P Xa p back-crossed with a p (pedigree E7291) gave 
26 A P, 24 A p, 25 aP, and 27 ap. Of the 1151 plants of the twelve 
back-cross progenies, 560 had the parental combinations of plant and 
pericarp colors while 591 had recombinations of these characters, a 
recombination percent of 51.3+1.0. This indicates that the factors A 
and P are independent (not linked) in inheritance. 


Relation to P, 


The plant-color factor P; has no influence on the production of red 
pericarp color. Data from back-crosses of F, plants with the double 
recessive are given in tables 2 and 3. Crosses each parent of which had 
one dominant and one recessive factor are listed in table 2. 

Crosses one parent of which had the dominant factors and the other 
the two recessive ones are given in table 3. 

















TABLE 2 
Back-crosses of Pi p X i P with p p. 
PARENT TYPES RECOMBINATIONS 
PEDIGREE 
NUMBERS — 
Pip bP P,P 2, ? 

A238 47 59 45 57 208 

A241 32 28 31 37 128 

A243 15 17 13 12 57 

A248 33 41 32 42 148 

A249 45 40 39 55 179 

A250 47 23 36 28 134 

E1514 10 22 17 12 61 

E1564 » § 1 4 7 17 

E2019 16 10 13 6 45 

E2021 24 16 15 10 65 

E7297 29 23 25 21 98 

E9094 25 25 25 32 107 

Sub-total 328 305 295 319 

Total 633 614 1247 
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The percent of recombination for the data in table 2 is 49.2+1.0 and 
for those in table 3 is 49.4+1.2, thus indicating independent inheritance 
of P; and P. 

Relation to R 


Crosses involving red pericarp and the aleurone-color factor R, back- 
crossed to the double recessive (white-pericarp “‘R tester,” p A Cr) have 
given the results recorded in table 4. 

The recombination percent, 50.9+1.1, indicates that P and R are 
inherited independently. 


Linkages of P 
Kempton (1921) has reported linkage between the factors for red 


pericarp and brachytic culms. The percent of crossing over found was 


TABLE 3 
Back-crosses of Pi) P X pi p with pj p. 





























PARENT TYPES RECOMBINATIONS 
PEDIGREE TOTAL 
NUMBERS P,P p, ? Pip 2, P 
AS11 55 57 62 47 221 
A529 96 111 81 96 384 
A654 31 47 42 61 195 
E5105 9 5 8 4 26 
Sub-total 191 220 193 208 
Total 411 401 812 
TABLE 4 


Back-crosses of Rp Xr P withr p. 




















PARENT TYPES RECOMBINATIONS 
PEDIGREE TOTAL 
NUMBERS Rp rP RP rp 
A1819 91 77 67 92 327 
A1825 38 38 43 48 167 
E7297 21 21 38 18 98 
E7299 21 17 22 19 79 
E7301 52 38 44 43 177 
E7345 8 13 8 7 36 
E9098 31 21 26 30 108 
Sub-total 262 225 248 257 
Total 487 505 ] 992 
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about 37. Emerson (1920) has reported a close linkage of red pericarp 
color with tassel seed. Larger numbers have since been grown. There 
is about one percent of crossing over. Linkage of red pericarp with 
fine-striped leaf has also been found. The percent of crossing over is 
about 35. The data concerning this linkage are to be published in a later 
paper. 

Factor relations of cherry pericarp 


Relation to R 


Crosses of cherry pericarp with purple aleurone (A C R) have been 
back-crossed to R-testers (A Cr). The F; could not be used as the pistil- 
late parent of the back-cross, as cherry pericarp obscures the aleurone color. 


TABLE 5 


Back-crosses of cherry pericarp, rch x purple aleurone, R, with white-pericarp 
R-testers, A C r. 














PURPLE SEEDS WHITE SEEDS 
PEDIGREE 
NUMBERS Cherry White Cherry White 
pericarp pericarp pericarp pericarp 
A 204 0 9 8 0 
A 276 0 14 7 0 
A 280 0 47 45 0 
A 452 0 29 22 0 
A 711 0 35 28 1 
A 713 0 7 5 0 
A 715 1 83 96 1 
A 717 1 243 227 3 
A 719 1 141 129 2 
A1778 1 77 68 0 
A1784 1 63 41 1 
Total 5 748 676 8 

















When the reciprocal back-cross was made, the dominant R being brought 
in by the pollen only, the resulting seeds were mottled, as described by 
EMERSON (1918) and Kempton (1919). These mottled seeds were fre- 
quently very light and perhaps even entirely white, making the classifica- 
tion of aleurone color not entirely satisfactory. The distributions are 
given in table 5. All cultures throwing sun-red or dilute-sun-red plants 
were discarded, as cherry pericarp occurs only on purple and dilute-purple 
plants. 

In general the purple seeds gave only white pericarp and the white 
seeds gave only cherry. Thirteen exceptions were recorded from a total 
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of 1437 back-cross plants. Progenies grown from open-pollinated ears 
of the cherry exceptions gave no evidence of any recombination or crossing 
over having occurred. All of the exceptions were probably due to the 
difficulty of classifying the seed with respect to aleurone color. 

The test was repeated, making use of the effect of the R factor on plant 
and anther color as well as on aleurone color. The results are given in 
table 6. 














TABLE 6 
Back-crosses of cherry pericarp, rch x purple alewrone, R®, with white-pericarp R-testers, A C r§. 
PURPLE SEEDS WHITE SEEDS 
PEDIGREE 
NUMBERS Dilute-purple plants, Green plants, Dilute- purple plants, Green plants, 
cherry pericarp white pericarp cherry pericarp white pericarp 
A1786 1 69 63 7 
A1790 1 69 64 3 
A1794 0 13 9 0 
A1796 1 61 74 6 
A1804 1 40 39 0 
Total 4 252 249 16 

















The ears failed to mature as well as in the previous test, and aleurone 
color classification was more difficult. Twenty exceptions were recorded 
from a total of 521 plants. There were no recombinations of plant color 
and pericarp color. Of the four cherry exceptions, two were selfed and 
gave only colorless aleurone. Of the sixteen white-pericarp exceptions, 
twelve were selfed or crossed with R testers. All were heterozygous for 
aleurone color. Three others had open-pollinated ears throwing a large 
percentage of colored seeds. One plant failed to produce a good ear. 
Thus, of a total of twenty apparent exceptions, fourteen were proven not 
to be recombinations of aleurone and pericarp color, and three more gave 
strong indications of not being. The three untested plants were probably 
not different from those tested. 

As indicated above, recombinations of cherry pericarp with aleurone and 
plant color (R factor) probably do not occur. We, therefore, assume that 
the factor responsible for cherry pericarp is one member of the allelomor- 
phic series R’, R*, r’, r* for plant and aleurone color. This allelomorph we 
designate by the symbol r“. 


Relation to P; 


Crosses of cherry pericarp with sun-red or dilute-sun-red plants give 
progenies which, in the F2, segregate for both plant and pericarp colors. 
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Among the purple and dilute-purple plants, cherry and white pericarps 
occur in ratios approximating 3 to 1. All the sun-red and dilute-sun-red 
plants have white pericarp. In a back-cross (A721) to the double reces- 
sive the following numbers of individuals of the indicated color types were 
obtained: 


Dilute-purple plants with cherry pericarp 84 

Dilute-purple plants with white pericarp 95 

Dilute-sun-red plants with white pericarp 174 
TABLE 7 


Back-crosses of R Pi Xr pj with R-testers, A Cr py. 























PEDIGREE PARENT TYPES RECOMBINATIONS 
NUMBERS TOTAL 
R P; 1} R ?} r P} 
E1555 14 17 18 28 77 
E1557 14 14 15 15 58 
E1561 5 9 10 13 37 
E1570 13 12 9 12 46 
E1572 9 9 6 12 36 
E1575 24 24 24 24 96 
E5301 16 16 19 16 67 
E7297 32 24 37 16 117 
E7299 22 23 33 35 113 
E7344 13 20 8 19 60 
E8259 10 10 15 8 43 
Sub-total 172 178 194 198 
Total 350 392 742 














Since r“ is one of the multiple allelomorphs of the R series, linkage or 
independent inheritance of r“ and P; can be determined from crosses 
involving P; p; and the aleurone factor pair Rr. Data from back-crosses 
of RP: Xr p: with r p; are given in table 7 and of R p.Xr P,; with r p, 
in table 8. 

The records of table 7 give 52.8+1.2 percent, those of table 8 give 
47.9+1.9 percent, and those of the two combined give 51.2+1.0 percent 
of recombination. Obviously r“ is inherited independently of P:. 


Relation to A 


The relation of cherry pericarp to the A factor may be illustrated by 
culture A93 obtained by selfing a dilute purple plant with cherry pericarp. 
This culture gave six dilute-purple plants with cherry pericarp and one 
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green plant with “brownish” pericarp. The latter bred true and when 
crossed with dilute-sun-red gave only dilute-purple plants with cherry 


pericarp. Its constitution must have been aa P, P, rr“. 


One of its 


dilute-purple sibs selfed gave 22 dilute-purple plants with cherry pericarp 
and 10 green plants with brownish pericarp. Other cultures have given 
two types of green plants, the one type with brownish pericarp, the other 
with white. Tests of these have shown the brownish-pericarp plants to 
be of the constitution a P; r“, and the white-pericarp plants to be lacking 


either dominant P; or the cherry allelomorph of R. 


TABLE 8 
Back-crosses of R pi Xr Pi with R-testers, A Cr py. 

















PARENT TYPES RECOMBINATIONS 
PEDIGREE 
NUMBERS R rl e P; R P; = ” TOTAL 
E8261 30 41 30 31 132 
E8263 17 22 15 20 74 
E8723B 2 7 8 10 27 
E8725 15 10 15 10 50 
E8729 14 11 11 12 48 
Sub-total 78 91 79 83 
Total 179 162 341 














The three factors concerned here are independent in inheritance. 
The independence of R and P; has been shown above (tables 7 and 8). 
That factors A and R and factors A and P; are inherited independently 


has been shown by EMERSON (1918, 1921). 


The relation of these factors to plant and pericarp colors is as follows: 


Pericarp color Plant color 
A P,r* Cherry Purple or dilute purple 
APir White Purple or dilute purple 
A pir® White Sun-red or dilute sun-red 
Apr’ White Sun-red or dilute sun-red 
a P,r Brownish Brown or green 

aP,r' White Brown or green 

ap, "a White Green 

apr White Green 


DISCUSSION AND SUMMARY 


The pericarp colors which are of most frequent occurrence in North 
American maize can be divided into two strikingly different categories: 
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1. A series of allelomorphic or closely linked types characterized by a 
rather insoluble brick-red or orange-red pigment. These types differ 
among themselves chiefly in the amount and distribution of this pigment. 
The brick-red pigment is replaced by a yellowish one when the factor a 
is homozygous, the distribution and relative intensity remaining the same. 
The red pericarp pigment develops on sun-red, dilute-sun-red, purple and 
dilute-purple plants, and the brown pericarp pigment on brown and green 
plants. The characteristic factor P is independent of A, R, and P; and 
is linked with the factors for brachytic culm, tassel seed, and fine- 
striped leaf. 

2. Cherry pericarp color, due to the presence in the pericarp of the 
water-soluble purple pigment (anthocyanin) found commonly in the 
sheaths, leaves, anthers, aleurone, etc. Its development is determined 
by two dominant factors,.the factor r“ and the purpling factor P, for 
plant color. The purple pigment is replaced by a yellowish pigment 
when the factor a is homozygous. Since dominant P; is required for the 
production of cherry pericarp color, it cannot occur on sun-red or dilute- 
sun-red plants. The factor r is one member of the R series of allelo- 
morphs for aleurone and plant color and is consequently linked with the 
factors for golden leaf and luteus seedlings. It is independent of P, Pi, 
and A. The factor P; is independent of R, P, and A, and is linked with 
the factors for yellow endosperm and salmon silks. 

The interaction of factors in the-production of pericarp color in the two 
series may be represented thus: 


Red series: 
PA red 
Pa brown 
pA white 
pa white 


Cherry series: 
r* Pi A cherry 


r* Pia brownish 
rhb, A white 
r* bia white 


Other allelomorphs of the R series give white pericarp only. 
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INTRODUCTION 


Scientific literature contains many accounts of abnormal and double 
organisms which were produced by mutilations. Less often cases have 
been reported in which the abnormalities were found in nature, or ap- 
peared in mass cultures. Among the interesting reports of this kind might 
be mentioned the work of BALBIANI (1891), involving Stentor; JENNINGS 
(1908), Paramecium; Mast (1909), Didynium; Dawson (1920), Oxy- 
tricha; and many others. It will be observed that all of the above organ- 
isms are ciliates possessing anterior and posterior regions. Thus, as a 
rule, there was a tendency for normal individuals to be produced regu- 
larly from one end 6f the dividing organism, so that the percentage of 
normal progeny to abnormal ones became greater and greater. Further- 
more, it has been found that sooner or later monstrosities either became so 
abnormal as to be unable to survive, or else the malformations were 
gradually lost, so that in time all of the surviving animals were of the 
normal type. 


1 From the Department of Medical Zodlogy, School of Hygiene and Public Health, Jouns 
HOPKINS UNIVERSITY. 
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The organism under discussion in this paper differs somewhat from 
those mentioned above in that it is a rhizopod, is not differentiated into 
anterior-posterior regions, and the shape of the offspring is more or less 
determined by the mould of the parent shell. 


THE PROBLEM 


While carrying on some experiments involving the shelled rhizopod 
Arcella polypora Penard, a double individual suddenly appeared in a 
Petri-dish culture of clone Q, line gd, which had been kept under obser- 
vation for five months.” This abnormal form was discovered on Novem- 
ber 21, 1922, and immediately thereafter was isolated and transferred to 
a depression slide containing hay infusion. Two days later it divided, 
producing an offspring similar to itself. This aroused sufficient interest to 
initiate the formulation of a plan to ascertain how long, and to what 
extent, the unusual characteristics would be inherited. This paper is an 
attempt to set forth briefly the results of that investigation. 


THE ORGANISM 


As pointed out above, the progenitor of the organisms used in this study 
suddenly appeared in a “‘pure-line” culture of Arcella polypora Penard. 
It is impossible to say what the stimulus was that caused the production 
of such an unusual form, for the line had been perpetuated by vegetative 
reproduction for over 150 generations, and during that time no marked 
abnormalities were observed. The culture in which this double animal 
appeared was a relatively young one, and since I have never observed 
conjugation among the individuals in such cultures, it seems very unlikely 
that the condition was brought about by a peculiar physiological state 
prior to, or during, syngamy. Furthermore, it is hardly probable that a 
general unfavorable condition could have prevailed in the culture medium, 
for there were many Arcellae in the same container, and all of the others 
were normal in appearance. 

A. polypora is described as possessing many nuclei, usually eight or ten. 
Unfortunately the nuclei are partially, or completely, obscured by the cell 
inclusions, so that it is usually impossible to determine how many are 
present in a living individual; but by staining they can be counted easily. 
The average diameter of the normal specimens of the line from which the 
abnormal form was derived, was approximately 100 micra. The size, 
shape, and nuclear number of the aberrant specimen, strongly suggest 


? This double form is mentioned, and a drawing of it is given, in a recent paper by the author 
(REYNOLDS 1923). 
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that it was truly a double individual, produced, perhaps, by the failure of 
two cells to separate during the process of fission. If such was the case, 
however, it appears that separation occurred on one side of the oral aper- 
ture, while on the other side, the shell of each individual was absorbed and 
the two sarcodes joined at this point, so that the protoplasm formed a 
continuous mass from the distal end of one daughter cell to that of the 
other (see figure 1, A, B and C). 





T'iGureE 1.—Showing three views of the abnormal Arcella which was discovered on Novem- 
ber 21, 1922. A, lateral view, organism on its back; B, latero-dorsal view, organism in process 
of locomotion; C, dorsal aspect. (Camera lucida X 170.) 


As can be seen in figure 1, A, the animal looked somewhat like a bivalve, 
the two ends folding on each other like the letter U. The mouth of the 
shell was also unusual in that it was ellipsoidal rather than circular. Loco- 
motion was accomplished in a half reclining position as shown in figure 1, 
B. When a new shell was formed, during the production of an offspring, 
its longer axis was at right angles to the long axis of the parent shell 
(see figure 2). Compare this with Root’s (1918) description of division 
in Centropyxis aculeata, in which he shows the long axes of the two cells 
to be at an angle of 180 degrees to each other. 


A B 


FIGURE 2.—Lateral views showing positions of old and new shells during the division of 
an abnormal Arcella, the old shell on top. B represents A turned through an angle of ninety 
degrees. (Camera lucida 170.) 


METHODS 


The culture medium used in these experiments consisted of 10-percent 
hay infusion, prepared by placing ten grams of clean timothy hay in 
250 cc of distilled water and boiling it slowly for five minutes. Then one 
part of this solution was added to nine parts of distilled water. 
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When the original monstrosity appeared, it was isolated and designated 
as P. Upon the appearance of the first offspring, the two organisms were 
separated and placed into fresh containers, the individual occupying the 
newly formed shell being called Pa. In this manner the animals were 
separated and named until there was a total of seven, from each of which 
a new clone was started. Considering P as the parent, the others were 
related to it as follows: Pa, Pb, and Pc were daughter-cells; Pa2 and Pa3 
were granddaughter-cells; Pa2a was a great granddaughter-cell. Draw- 
ings and measurements were made of these seven specimens. Then four 
of them (Pa, Pb, Pa2, and Pa2a) were placed in separate Petri dishes and 
permitted to multiply without being disturbed. The other three (P, Pa3, 
and Pc) were cultured in hollow-ground slides. As additional individuals 
appeared they were transferred to new concavities; in each case the strain 
was perpetuated through the specimen inhabiting the younger shell. The 
individuals remaining in the old shells were allowed to grow em masse 
for several days, and then they and their descendants were preserved 
in 70 percent alcohol. After a suitable period of time the organisms in 
the Petri-dish cultures were carefully examined, and then placed in 70- 
percent alcohol for future reference. 

After the experimental work had been completed, the preserved speci- 
mens were put into Petri dishes containing glycerine, the alcohol being 
removed by evaporation. ‘This left the organisms in a viscous liquid. 
Then, by using a fine glass filament, it was possible to place an individual 
in any desired position, which position would be maintained long enough 
to enable one to make camera lucida sketches of the object. 


EXPERIMENTS DEALING WITH METHODICAL SELECTION 


On November 29, 1922, after seven individuals had been obtained, as 
described in the preceding paragraph, animals P, Pa3 and Pc, were placed 
in concavities and designated as the progenitors of clones P, Pa3 and Pc, 
respectively. The results obtained for each of these clones were so nearly 
parallel that they will be discussed collectively. When the first offspring 
appeared, they were transferred to separate concavities containing fresh 
culture medium, while the “parent” organisms were left in the original 
containers until the fifth of December. By that time several individuals 
had been obtained from each of them. These were examined on that day 
and then preserved in 70-percent alcohol. The procedure of selecting the 
individual inhabiting the younger shell for perpetuating the clones, and 
keeping those in the older shells for a few days, and then preserving them 
and their descendants, was practiced regularly for two months. The pur- 
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pose of such a method is evident: first, it standardized the process; and 
second, if the mould of the “parent” shell influenced the shape of the 
progeny, then it was reasonable to presume that the greatest variation 
could be obtained by selecting the specimen whose shell was most dis- 
tantly removed from the original animal. 

The total number of individuals obtained by the end of the sixty-day 
period, is as follows: for clone P, 1352; for clone Pa3, 4737; and for clone 
Pc, 5182,—or a grand total of 11,271. There was not a single normal 
individual among this entire lot. On the other hand, upon examining 
the size and other morphological characteristics, there was positive evi- 
dence of the fact that there had been a decided trend towards a return to 
the normal condition in each of the clones. This tendency is very strik- 
ingly illustrated in figure 3. 


A A'* A" 
B B! B" 


(a 
aQ 
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N 

Ficure 3.—Lateral views of individuals representing different time intervals during the 
breeding experiments with abnormal Arcellae. A, the original progenitor of clone P; A’, a 
descendant of P removed sixteen generations; A” a descendant of P removed forty-two genera- 
tions. B, the original progenitor of clone Pa3; B’, a descendant of Pa3 removed fifteen 
generations; B’’, a descendant of Pa3 removed forty-five generations. C, the original pro- 
genitor of clone Pc; C’, a descendant of Pc removed sixteen generations; C”’, a descendant 
of Pc removed forty-five generations. N, a normal individual drawn to the same scale. 
(Camera lucida 170.) 
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Another point of interest is brought out when we compare the fission 
rate of the abnormal specimens with that of normal Arcellae. The 
average rate of reproduction among normal individuals, under conditions 
employed in these experiments, was approximately one generation per 
day. During the first forty days in which the abnormal forms were cul- 
tured, it was found that there were, on the average, about three divisions 
in five days. From this time on, the abnormal specimens produced 
approximately four and one-half offspring in five days. This sudden in- 
crease corresponds closely with a decrease in size and degree of abnormality 
on the part of the abnormal organisms. Figure 4 shows this graphically. 
The rate of division exhibited in clones P, Pa3, and Pc, were so nearly 
parallel that a composite graph has been constructed for the three of them, 
and it is compared with the division rate in a series of normal individuals 
of the same line (clone Q, line gd) in which the double specimen appeared. 
The graph is plotted so that each unit represents the number of divisions 
during a period of five days. 














ple 
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F'iGurr 4.—A comparison of the fission rates, five-day intervals, between a normal series N 
(broken line) and a composite of three series of abnormal organisms A (unbroken line) over a 
period of sixty days. Both the normal and abnormal series were descended from a common 
ancestor. The rise in the unbroken line on the fortieth day corresponds with a decrease in the 
size of the abnormal organisms. 


Although no normal individuals were obtained in any of these clones 
during a period of sixty days, the results were such as to indicate that the 
organisms concerned were gradually approaching that state. These 
experiments were not continued beyond this point, because experience 
had demonstrated that the process would be a long, slow one; consequent- 
ly, they were abandoned in favor of other methods. 


SELECTING FOR NORMALITY FROM MASS CULTURES 


On December 21, 1922, two mass cultures were started by taking 
individuals from clones Pa3 and Pc and placing them in Petri dishes con- 
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taining 25 cc of 10-percent hay infusion. The purpose of these experiments 
was to determine if it was possible to obtain normal Arcellae from the 
abnormal forms, by growing them en masse and selecting, each time new 
cultures were made, those specimens showing the least degree of abnor- 
mality. Upon examining the Petri-dish cultures on January 8, 1923, they 
were found to contain several thousand organisms. These were inspected 
under a microscope, and the most normal-appearing animal was selected 
from each group, for the purpose of “seeding” other cultures. Both 
organisms thus chosen were found to be approximately the size of normal 
individuals, although the curvature of the shell had not disappeared; the 
depth of the notch in the animal from clone Pa3 being 11 micra, while in 
the one from clone Pc it was 13 micra. Each of these organisms was 
placed in a fresh container and allowed to multiply without interruption 
until February 1, 1923. On.that day the cultures were examined again, 
with the result that about five percent of the organisms in each dish were 
apparently normal in every respect, while others showed a greater degree of 
abnormality than the animals from which they were descended which were 
placed in the dishes on January 8. It is evident, then, from these experi- 
ments, that normal Arcellae could be obtained from the abnormal variety; 
yet, the tendency to revert back to the original type was not so noticeable 
as one usually encounters. The time required to “breed out” the dual 
characteristics which suddenly appeared, was approximately seventy 
days (from November 21, 1922, until the latter part of January, 1923). 


SELECTING FOR GREATER ABNORMALITY FROM MASS CULTURES 


Having determined that normal individuals could be obtained from the 
abnormal forms by selection, a similar series of experiments was begun 
on February i, 1923, but in this case the individuals exhibiting the great- 
est degree of abnormality were chosen for perpetuating the strain. The 
animal selected as the progenitor of this strain showed about the same 
degree of abnormality as was shown by the first double form that ap- 
peared. Fourteen days later the culture was examined, and the most 
abnormal form found was transferred to a Petri dish containing fresh 
culture medium. An examination of this culture on February 28 revealed 
many very abnormal specimens, a large percentage of which were clumped 
together in groups of twos and fours, as if they were unable to completely 
disentangle themselves after fission had taken place. The most unusual 
specimen found in this culture was isolated, and transferred to a depres- 
sion slide. After making several drawings of the organism, the slide 
containing it was placed in a moist chamber. The animal lived in this 
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condition for two days, but no progeny was produced. Further experi- 
ments along this line were not attempted, since it was considered 
improbable that such a form could live very long. Figure 5 will show 
to what extremes the abnormal characteristic was carried. 


/ 


i \ 


A B C 


Figure 5.—Showing lateral views of abnormal organisms obtained by selecting for that 
characteristic from mass cultures. A, the organism with which the strain was started on February 
1, 1923. B, the most abnormal form found in the culture on February 15. C, the most ab- 
normal organism found in the culture on February 28. This specimen, which lived but two days 
after being isolated, was the most abnormal animal that was encountered in the entire work 
(Camera lucida X 170.) 


RETURN OF DOUBLE CHARACTERISTICS 


It has been shown in a previous part of this paper that normal organisms 
could be obtained from the abnormal forms. The question naturally 
arises: Do these normal individuals permanently retain their normality, 


or is there a tendency for some of them to inherit the peculiar structural - 


characters of their double ancestors? In order to answer this query a 
number of Petri-dish cultures were made from normal Arcellae that had 
been obtained by selection, as described above. The stock cultures, from 
which the mass cultures were “‘seeded,” were carried in depression slides, 
the organisms being transferred to fresh medium every other day. After 
a period of twenty days the cultures were examined under a Greenough 
binocular (No. 10 eye-piece and 32mm objective). The animals in each 
dish were either counted or carefully estimated, and also the number 
of specimens showing a noticeable degree of abnormality, was recorded. 
The results of these experiments are shown in table 1. It is possible that 
those cultures which did not produce any abnormal forms in the period of 
twenty days, would have yielded double organisms had the observations 
been extended over a longer period of time. That point, however, was 
not considered of sufficient importance to justify further experimentation 
on the subject. 
MASS CULTURES 


As pointed qut in the section dealing with methods, four of the first 
seven organisms that appeared (Pa, Pb, Pa2,and Pa2a), were cultured en 
masse by placing them in Petri dishes containing 10-percent hay infusion. 
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TABLE 1 




















NUMBER OF | DATE CULTURES NUMBER OF SPECIMENS NUMBER OF DOUBLE 
EXPERIMENT | WERE STARTED | OBTAINED AFTER 20 DAYS ORGANISMS 
~-— ++ | —__—_—|—— 3 SSeS UES EE 
A | March 1 | 9.500 0 
B | March 1 12,000 126 
& March 1 4,500 0 
D | March 1 7,200 18 
Ik March 3 2,600 0 
F March 5 15,000 0 
G | March 5 350 (died) 0 
H | March 7 5,800 | 0 
I | March 21 12 (died) | 0 
J | March 21 6,000 34 
Total | 62,962 178 





These cultures were started on November 29, 1922, and maintained for 
twenty-one days. On December 20 they were examined, counted, and 
then preserved in alcohol. Following is the number of organisms found 
in each dish, and the remarks made regarding them the day they were 
examined: 

Clone Pa yielded 263 organisms; there were no normal specimens among 
them, though some were much less abnormal than their clonal parent. 
They showed a great variation in size, some being twice as large as others. 

Clone Pb yielded 2500 organisms; size variation great; no normal forms 
present. 

Clone Pa2 yielded 1047 organisms; marked tendency towards normal 
condition evident; no normal forms present; size fairly uniform as com- 
pared with clone Pa. 

Clone Pa2a yielded 74 organisms all of which were double; culture died 
previous to the examination. 


CORRELATION BETWEEN SIZE AND ABNORMALITY 


In an early part of this paper, size and shape were mentioned as two 
important features which indicated that the organisms under considera- 
tion were double. Thé large folded appearance of very abnormal animals, 
and the gradual decrease in size associated with a corresponding decrease 
in the depth of the notch produced by the folded ends, seemed quite ap- 
parent; but it was deemed advisable to apply mathematical proof before 
accepting this as an established fact. In order to do this, it was necessary 
to find some unit of measure which could be applied to a large number of 
organisms. The method adopted was as follows: The organisms ob- 





Genetics 8: S$ 1923 








486 BRUCE D. REYNOLDS 


tained during the first 21 days from mass cultures of clones Pa, Pb, Pa2, 
and Pa2a, and those obtained during the first 60 days from depression- 
slide cultures of clones, P, Pc, and Pa3, were placed in Petri dishes con- 
taining glycerine. The bottoms of the dishes were marked off into squares 
and each square was numbered. Then, approximately 100 individuals 
from each clone were selected at random, turned on their sides, and 
drawn (by means of a camera lucida) to a scale of 170. The geometrical 
figures employed in getting the units of measure are shown in figure 6. 


FiGuRE 6.—.\ represents a lateral view of a normal Arcella. if the figure NV is revolved 
around the axis « b through an angle of 180 degrees, a solid will be generated, the volume of 
which will approximate the volume of the organism. The ventral boundary of N parallels line x y, 
consequently, no notch is formed; therefore, the degree of abnormality in this case equals zero. 
B represents a lateral view of an abnormal Arcella. If this is resolved into two figures N and 
N’, and N revolved around a b and N’ around a’b’ through an angle of 180 degrees, two solids will 
be generated, the sum of the volumes of which will be nearly equivalent to the volume of the 
organism. The ventral boundary of this figure bulges up in the center, therefore the distance d 
represents the degree of abnormality present. 

A camera-lucida outline of an organism, as seen from the side, enables 
one to measure both the area (a) of the figure (which is nearly equal to a 
vertical section of the organism) and the depth of the notch (d). As 
pointed out in the legend to figure 6, it is possible to resolve such an out- 
line into two geometrical figures, which, if revolved around their respective 
axes through an angle of 180 degrees, will produce two solids, the sum of 
the volumes of which will approximate the volume (size) of the animal. 
Therefore, the area of these figures furnishes a unit which can be utilized 
in determining the size of the organisms. The areas of the drawings were 
obtained, in square micra, by means of a planimeter; then the results 
were divided by 170? in order to get the actual areas of vertical sections 
of the organisms concerned. 

As shown in figure 6, A, the ventral margin of a normal Arcella’s shell, 
as seen from the side, is straight; therefore, if, when a line is drawn along 
the ventral side of a figure, it parallels the ventral margin of the shell, 
the organism in question is normal. When the ventral edge of the shell 
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bends upwards in the center, it indicates that the organism showing this 
peculiarity is not normal in that particular respect; the deeper the notch 
formed by the upward bending, the greater is the degree of abnormality 
exhibited. The degree of abnormality was obtained by measuring the 
length of d (figure 6) and dividing the result by 170, thus giving the actual 
distance in micra. 

Six hundred and ninety individuals were selected at random and 
measured for @ and d in accordance with the method described above, 
and from the data thus obtained a correlation table was constructed as 
shown in table 2. The correlation between size and abnormality was 

TABLE 2 


Correlation table for sise and abnormality in 690 individuals. The unit of measurement for the degree 
of abnormality is 6 micra, and for size 345 square micra. Correlation 460 + .020. 


Size 
10 11 12 13 14 15 16 17 18 19 20 21 

















| 
8 10 4 3 2 1 1 21 
14 4+ 14 30 46 13 10 9 7 5 138 
20 2 13 46 66 46 21 6 7 3 2 2 214 
26 6 29 28 40 20 15 12 5 4 1 1 161 
32 3 18 10 14 6 9 6 1 1 3 1 72 
38 2 3 6 3 7 4 1 1 1 28 
> 
= 44 3 4 6 5 1 1 20 
g 50 1 ae 2 12 
= 
56 2 2 4 2 1 | 11 
62 1 1 2 3 | 7 
68 1 2 1 1 iz. 
74 1 | 1 
16 40 128 156 122 69 62 52 18 15 a | 690 


found to be .460+.020. Thus, a significant relation between these two 
elements was established, which, of itself, is sufficient to justify the former 
contention regarding the dual nature of these organisms. Aside from the 
correlation between size and abnormality, the measurements showed a 
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wide range of variation for each factor, and demonstrated also that all 
of the abnormal specimens encountered were larger than the average 
normal Arcellae from which they were descended. This is shown graphi- 
cally in figure 7. 
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FicurE 7.—Curves showing the distribution of abnormality (broken line) and area (un- 
broken line) in 690 specimens selected at random. The ordinates represent the number of 
individuals. In the abcissae, zero equals the depth of the notch (d) and also the area (a) of a 
vertical section of a normal Arcella; each division represents 6 micra for d, and 345 square micra 
for a. 


DISCUSSION 


Although no other paper dealing entirely with the inheritance of ab- 
normal characteristics in Arcella has ever been published, it is of interest 
to note that other investigators have found forms in nature similar to 
those under discussion. LrEmy (1879) and HEGNER (1919) have shown 
pictures of oval specimens with elongated oral apertures and bent shells. 
HEGNER cultured some of these abnormal Arcellae and found that, under 
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laboratory conditions, they quickly reverted to a normal shape. He 
concludes, therefore, that: 


“Since the descendants of these bent, oval specimens were normal and since 
no specimens of this sort have appeared among the thousands reared in the 
laboratory, it seems safe to conclude that some environmental condition is 
responsible for this peculiarity and that as soon as the controlling factor is 
removed, the normal characteristics are regained.” 


Since HEGNER was dealing with the same species (A. polypora), there 
are but three fundamental points of difference between his experiments 
and those incorporated in this paper. They are as follows: The organisms 
with which HEGNER was dealing (1) were found in nature, (2) were trans- 
ferred to different surroundings and cultured in pond water, and (3) 
showed only a slight curvature of the shell; while the organisms discussed 
in this paper (1) are descended from a single ancestor which appeared in a 
“pure-line” culture, (2) were kept amidst the same surroundings and 
cultured in 10-percent hay infusion, and (3) showed a very decided 
curvature of the shell. The results obtained in the two investigations 
are totally different: HrcNer found that all traces of abnormality dis- 
appeared within a few.generations; in these experiments the abnormal 
characteristic is just as noticeably present now (six months later) as it 
was in the beginning. It seems most reasonable to make use of a part of 
HEGNER’S conclusion in explaining the probable cause of the production 
of the original monstrosity, namely, that some environmental condition 
was responsible for this peculiarity. Yet, at our present stage of knowl- 
edge, it is impossible to make an analysis of the conditions and say which 
factor, or combination of factors, was responsible for the outcome. 
Certainly the agency must be a very subtle one, for the abnormality ap- 
peared in only one specimen, while there were hundreds of others in the 
same container. 

The second part of HEGNER’s conclusion, that “‘as soon as the control- 
ling factor is removed, the normal characteristics are regained,” is perhaps 
equally true; but in this case it seems better to modify the statement 
somewhat so that it will read: as soon as the controlling factor is removed 
there is a tendency for the organism to revert to the normal type. This is in 
line with the conclusion drawn by Dawson (1920) from his work on 
Oxytricha, in which he states: 


“Tt is therefore evident that the environment, although it was probably 
the principal factor in bringing about the pseudoplastogamic union, was not 
responsible for the continuance of the state of protoplasm necessary in such 
union.”’ 
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Further on in the same article, DAWSON suggests that the intimate union 
of the protoplasm of each component with the consequent possibility 
of protoplasmic interchange was responsible for the permanency of the 
twin condition found in the organisms with which he worked. 

To what extent this summarization can be applied to the double Arcella 
is a question. As previously stated, all of the evidence indicates that the 
organisms under consideration were dual in nature. Yet, the single buccal 
opening, the unbroken continuation of cytoplasm, and the possibility 
of nuclear wanderings (as shown by HEGNER 1920), make it more prob- 
able that these apparently double forms were physiologically comparable 
to a single individual. Furthermore, the author has shown (REYNOLDS 
1923) that the reaction of these abnormal animals to pseudopodial frag- 
ments of normal Arcellae was not unlike that of normal organisms. 
What, then, is the mechanism by which this abnormal characteristic was 
perpetuated? 

As JENNINGS (1908) has pointed out: 

“When a given structural charac- 
teristic arises during the reproductive processes so as to appear in a given 


generation, that is not because the same structure was present in a preceding 
generation.” 


Evidently, then, it is an acquired modification, produced by some change 
in the constitution. It is, perhaps, an internal reaction to an external 
stimulus. This phenomenon observed in Arcella seems to be different 
from the surface distortions observed in Paramecium by JENNINGS (1908) 
and STocKING (1915), or the peculiar physiological conditions found in 
Paramecium by JENNINGS (1908) and StockrneG (1915), and in Oxytricha 
by Dawson (1920). 

As JENNINGS (1908) has shown, when a superficial structure (such as a 
spine) occurs in Paramecium it is passed on to only one of the daughter 
cells. In this sense, then, it behaves as a somatic acquisition. When the 
physiological condition of the organism is so altered as to produce a 
tendency for the dividing cells to remain attached to each other, the 
phenomenon seems to be one of true inheritance, since the condition is 
passed on to all (or most) of the offspring alike. In the case of double 
and multiple organisms, the conditions present seem to be referable to 
the above explanation. Professor S. O. Mast informs me that he has 
observed, in his cultures of Didinium nasutum, monstrosities consisting 
of many individuals fused caudally, giving rise to normal individuals 
from the free ends. In this, as in many other similar instances, the 
tendency to inherit the peculiar physiological state seems to disappear 
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quickly. In practically all cases where double organisms have occurred, 
the well-being of such forms has been impaired by their inability to move 
around as freely as the normal individuals of their species; consequently, 
they are doomed to extinction by their inability to secure sufficient food, 
or to escape from enemies. 

In discussing the inheritance of the abnormal characteristics observed 
in these Arcellae it will be helpful to point out some of the differences 
between this form and other forms which have been studied: (1) it is a 
rhizopod, possessing less-differentiated structures than the ciliates; (2) a 
normal organism possesses many nuclei (which, so far as we know, are 
undifferentiated) which move around within the cytoplasm; (3) a normal 
animal crawls around on the substratum and is protected from some 
enemies by its hard shell; consequently, the only disadvantage entailed by 
a doubling up of the bulk. would be a larger volume to be nourished 
through one entrance; but this is cared for, in part, by a larger mouth 
opening and greater pseudopodial extension; (4) the shape of the parent 
shell is such that it mechanically influences the shape of the offspring. 
When we consider the results obtained inlight of these facts, it does not 
seem unusual that they should differ from those obtained by others. 
The conditions present here seem to approximate the requirements 
formulated by JENNINGS (1908, page 630), as being necessary for a new 
structure 5 to be inherited, namely: 

“‘(1) it must directly produce the structure }; (2) it must produce some per- 
manent change a in the constitution of the cell, such as will so modify the 
err of reproduction that they in their turn will produce some structure 

The graph in figure 4 will show that the large, abnormal organisms 
reproduce at a much lower rate than their normal relatives. As they 
decreased in size the fission rate became greater, so by the time their 
volumes approximated that of normal individuals, their rate of reproduc- 
tion very nearly equaled that found in normal animals. Likewise table 2 
shows a significant correlation between size and the degree of abnormality 
exhibited. In a similar manner, it might have been shown that there was 
a positive correlation between size and nuclear material. GRUBER (1912), 
working with Amoeba, and HEGNER (1920), working with Arcella, have 
shown that a very definite nucleo-cytoplasmic relation exists; therefore, 
it was considered unnecessary to ascertain this point. 

It is interesting to note, in connection with this work, that HEGNER 
(1920) cut Arcellae into two parts, each of which contained one or more 
nuclei. These pieces gave rise to normal animals which were smaller 
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than the individuals of their ancestral line, and usually they contained 
fewer nuclei. This condition was only temporary, however, for the 
ancestral size and nuclear number were soor regained. In Arcella dentata 
this was accomplished by the organism throwing off an empty shell; 
while in A. discoides and A. polypora it was brought about by a doubling 
of the nuclei, without being followed by division. In the same article 
HEGNER also states that a difference in the number of nuclei among the 
individuals of a given line was brought about by an uneven division of 
the nuclei at the time of fission. These two facts: (a) the doubling of 
nuclei under certain conditions, and (b) the unequal distribution of nuclei 
between the two daughter cells, probably furnish a key to the solution of a 
part of this problem. They enable one to speculate as to how the first 
monstrosity attained its size, and also why the progeny varied in both 
directions regarding size and abnormality. The portion which still 
remains a query is the agency responsible for initiating the tendency for 
such a peculiarly shaped shell. 


SUMMARY AND CONCLUSIONS 


1. On November 21, 1922, a very abnormal individual was found in a 
Petri-dish culture of Arcella polypora Penard. The line in which it ap- 
peared had been cultured in a pure state for the preceding six months, 
and during that time no abnormalities had occurred. Although several 
hundred related organisms were in the same dish, only one showed the 
aberrant characteristic. 

2. The size, shape, and nuclear number of the abnormal specimen 
indicate that it was a double organism, produced, perhaps, by the failure 
of two cells to completely separate during the process of fission. 

3. Upon isolating this form and culturing it in depression slides, it 
was found to breed true, all of the progeny produced during the first sixty 
days (about 45 generations) showing the abnormal characteristic. 

4. The fission rate of the abnormal organisms was, on the average, 
about three-fourths that of normal Arcellae. 

5. By growing the abnormal organisms in mass cultures, and selecting 
the least abnormal specimens each time new cultures were made (at 
intervals of two weeks), it was possible to obtain normal individuals 
from them in about seventy days. 

6. By growing the abnormal organisms in mass cultures, and selecting 
the most abnormal specimens each time new cultures were made, it was 
possible to emphasize the degree of abnormality. Within the period of 
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one month such very unusual forms were obtained that it was considered 
unnecessary to carry the experiments farther. 

7. Ten mass cultures were made of normal individuals which had been 
obtained from abnormal ancestors. After a period of twenty days three 
of the cultures contained some abnormal specimens. The total number 
of Arcellae obtained from these cultures was 62,962, and of that number 
178 were abnormal, or about .0028 percent. 

8. The correlation between size and abnormality for 690 individuals 
selected at random, was found to be .460+.020 
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